We scan new podcasts and send you the top 5 insights daily.
While video and voice capabilities for AI agents are technologically impressive, user behavior shows a strong preference for traditional chat and email. Data indicates roughly 85% of users will choose text-based interaction. Offer multimodal options, but prioritize and perfect the text experience first.
Power users of AI agents believe the ideal user interface is not graphical but conversational. They prefer text-based interactions within existing chat apps and see voice as the ultimate endgame. The goal is an invisible assistant that operates autonomously and only prompts for input when absolutely necessary, making traditional UIs feel like friction.
Contrary to the trend of building elaborate dashboards to track AI agents, a simpler approach is more effective. The guest manages his agent, Larry, through simple text messages on WhatsApp, treating him like a human employee. This avoids over-engineering and keeps the interaction natural and efficient.
Don't worry if customers know they're talking to an AI. As long as the agent is helpful, provides value, and creates a smooth experience, people don't mind. In many cases, a responsive, value-adding AI is preferable to a slow or mediocre human interaction. The focus should be on quality of service, not on hiding the AI.
User expectations for AI responses change dramatically based on the input method. A spoken query demands a concise, direct answer, whereas a typed query implies the user has more patience and is receptive to a detailed, link-filled response. Contextual awareness of input modality is critical for good UX.
Observing that younger generations prefer consuming information via video (TikTok) and communicating via voice, Superhuman's CTO predicts a fundamental shift in user experience. Future interfaces, including email, will likely become more conversational and audio-based rather than relying on typing and reading.
The interface for AI agents is becoming nearly frictionless. By setting up a voice-to-voice loop via an app like Telegram, users can issue complex commands by simply holding down a button and speaking. This model removes the cognitive load of typing and makes interaction more natural and immediate.
Power users are discovering that direct, conversational interaction with AI agents is more efficient than clicking through graphical user interfaces (GUIs). This signals a shift toward an 'app-less' world where tasks are accomplished via chat, potentially making traditional UI/UX design roles redundant for many applications.
Filevine discovered that customers prefer to ask its AI assistant, Lois, questions even when the answer is displayed directly on the screen in front of them. This indicates a fundamental shift in user behavior toward conversational interfaces, making them faster and more intuitive to train and use.
Differentiate communication needs. Use automation for low-stakes messages like appointment reminders. Reserve authentic, human video for high-stakes moments where emotional connection and trust are critical, such as discussing a major purchase or sensitive personal matters.
When building feedback tools, recognize that users favor audio. It's easier for multitasking, supports multiple languages, and feels less inhibited than writing. Conversely, video feedback is highly disliked and should be avoided as a primary collection method.