April 26, 2024

Visual grounding

Using the source input as ground truth will help trust the system and makes it easy to interpret its process and what might have gone wrong.

A screenshot of a pricing table or spreadsheet. The main focus is a highlighted cell displaying the amount "€ 35,00". This same amount is also shown in a separate box or label within the image. The surrounding cells contain various other monetary amounts ranging from around €27 to €165. Based on the layout and formatting, this seems to be a financial or pricing-related document.
Human needs

When checking data, I want to be able to see how the system arrived at its answer, so I can trust the data and identify any potential errors in the process.

  • AI Transparency and Explainability: Make AI systems transparent and understandable by explaining how and why decisions are made.
  • Multimodal Context: In this example we used the context of an image of a receipt, but it can also include other modalities, such as audio.
Explore Further

More of the Witlist

May 2024
Translate in X terms

Provide relatable and engaging translations for people with varying levels of expertise, experience and ways of thinking.

Jun 2024
Navigate the space

Ordering content along different interpretable dimensions, like style or similarity, makes it navigable on x and y axes facilitating exploration and discovery of relationships between the data.

Jun 2024
Language as a tangible material

Textual information often misses intuitive cues for understanding relationships between ideas. AI can clarify these connections, making complex information easier to grasp quickly.

Jun 2024
Semantic highlights

Embedding models can rank data based on semantic meaning, evaluating each individual segment on a spectrum to show its relevance throughout the artifact.

Jul 2024
Generate quick replies

An intelligent assistant that analyzes emails to identify questions and feedback requests, providing pre-generated response options and converting them into complete and contextually appropriate replies.

May 2024
Proactive agents

Proactive agents can autonomously initiate conversations and actions based on previous interactions and context providing timely and relevant assistance.