References
April 26, 2024

Visual grounding

Using the source input as ground truth will help trust the system and makes it easy to interpret its process and what might have gone wrong.

A screenshot of a pricing table or spreadsheet. The main focus is a highlighted cell displaying the amount "€ 35,00". This same amount is also shown in a separate box or label within the image. The surrounding cells contain various other monetary amounts ranging from around €27 to €165. Based on the layout and formatting, this seems to be a financial or pricing-related document.
Human needs

When checking data, I want to be able to see how the system arrived at its answer, so I can trust the data and identify any potential errors in the process.

Considerations
  • AI Transparency and Explainability: Make AI systems transparent and understandable by explaining how and why decisions are made.
  • Multimodal Context: In this example we used the context of an image of a receipt, but it can also include other modalities, such as audio.
Explore Further

More of the Witlist

OpenAI
May 2024
Conversation starters

Starting with a blank canvas can be intimidating, but providing prompt starters can help individuals overcome this initial hurdle and jumpstart their creativity.

Perplexiity
May 2024
Follow-up on an answer

Letting people select text to ask follow-up questions provides immediate, context-specific information, enhancing AI interaction and exploration.

Witlist
Jun 2024
Interactive writing partners

AI collaboration agents can act as writing partners that assist people by enhancing their content through transparent, easily understandable suggestions, while respecting the original input.

Aug 2024
Realtime voice interfaces

Voice interfaces should dynamically adapt to user interruptions, seamlessly incorporating them into the conversation ensuring a fluid and responsive dialogue.

Witlist
May 2024
Spatial Prompting

Spatial prompting integrates spatial relationships into prompts, offering a novel approach to manipulate concepts. This dynamic approach can lead to more intuitive and creative outcomes.

Witlist
Jun 2024
Describe processes

AI actions often take time to complete. To improve user experience, use descriptions of what is happening combined with basic animations that represent different types of actions.