References
April 26, 2024

Visual grounding

Using the source input as ground truth will help trust the system and makes it easy to interpret its process and what might have gone wrong.

A screenshot of a pricing table or spreadsheet. The main focus is a highlighted cell displaying the amount "€ 35,00". This same amount is also shown in a separate box or label within the image. The surrounding cells contain various other monetary amounts ranging from around €27 to €165. Based on the layout and formatting, this seems to be a financial or pricing-related document.
Human needs

When checking data, I want to be able to see how the system arrived at its answer, so I can trust the data and identify any potential errors in the process.

Considerations
  • AI Transparency and Explainability: Make AI systems transparent and understandable by explaining how and why decisions are made.
  • Multimodal Context: In this example we used the context of an image of a receipt, but it can also include other modalities, such as audio.
Explore Further

More of the Witlist

Witlist
Apr 2024
Realtime prompt feedback

Guide users to understand what makes a good prompt will help them learn how to craft prompts that result in better outputs.

Witlist
Jun 2024
Describe processes

AI actions often take time to complete. To improve user experience, use descriptions of what is happening combined with basic animations that represent different types of actions.

Witlist
May 2024
Proactive agents

Proactive agents can autonomously initiate conversations and actions based on previous interactions and context providing timely and relevant assistance.

Witlist
Jul 2024
Generate quick replies

An intelligent assistant that analyzes emails to identify questions and feedback requests, providing pre-generated response options and converting them into complete and contextually appropriate replies.

Aug 2024
Realtime voice interfaces

Voice interfaces should dynamically adapt to user interruptions, seamlessly incorporating them into the conversation ensuring a fluid and responsive dialogue.

Midjourney
May 2024
Incentivize giving feedback

Presenting multiple outputs helps users explore and identify their preferences and provides valuable insights into their choices, even enabling user feedback for model improvement.