Visual grounding
Using the source input as ground truth will help trust the system and makes it easy to interpret its process and what might have gone wrong.
When checking data, I want to be able to see how the system arrived at its answer, so I can trust the data and identify any potential errors in the process.
- AI Transparency and Explainability: Make AI systems transparent and understandable by explaining how and why decisions are made.
- Multimodal Context: In this example we used the context of an image of a receipt, but it can also include other modalities, such as audio.
More of the Witlist
Voice interfaces should dynamically adapt to user interruptions, seamlessly incorporating them into the conversation ensuring a fluid and responsive dialogue.
Input design concepts in small bits and see the cumulative output in real-time. Explore different combinations and immediately visualize the results, making the creative process interactive and flexible.
A smart browser assistant that understands the context of your open tabs to offer relevant suggestions and actions, enhancing productivity through transparency and control.
Generative AI can provide custom types of input beyond just text, like generated UI elements, to enhance user interaction.
AI excels at classifying vast amounts of content, presenting an opportunity for new, more fluid filter interfaces tailored to the content.
Ordering content along different interpretable dimensions, like style or similarity, makes it navigable on x and y axes facilitating exploration and discovery of relationships between the data.