Gertjan Burghouts, TNO, The Netherlands | Trusted Artificial Intelligence and Autonomy

Localizing military vehicles by text-image attention: A textual prompt such as "air fighter" can be correlated with image content to localize military vehicles. We do this by looking where an AI model pays attention to. The AI model is a Transformer based on language and images (CLIP). Contributors: Dr. Gertjan Burghouts and Pieter Elands