Difference between revisions of "Human Computer Interaction"

From GISAXS
Jump to: navigation, search
(AI Computer Use)
(Smart Glasses)
 
(16 intermediate revisions by the same user not shown)
Line 12: Line 12:
 
* [https://www.shop.iyo.audio/shop/p/iyo-one Iyo One earbuds] ([https://techcrunch.com/2024/05/27/iyo-thinks-its-gen-ai-earbuds-can-succeed-where-humane-and-rabbit-stumbled/ $600])
 
* [https://www.shop.iyo.audio/shop/p/iyo-one Iyo One earbuds] ([https://techcrunch.com/2024/05/27/iyo-thinks-its-gen-ai-earbuds-can-succeed-where-humane-and-rabbit-stumbled/ $600])
 
* [https://www.omi.me/ Omi] [https://github.com/BasedHardware/Omi open-source] wearable ([https://www.omi.me/checkouts/cn/Z2NwLXVzLWVhc3QxOjAxSkgzQkFLSjBWUDUzNEdWTVBDWUQ5WTUz?cart_link_id=i3ho9H7P $90])
 
* [https://www.omi.me/ Omi] [https://github.com/BasedHardware/Omi open-source] wearable ([https://www.omi.me/checkouts/cn/Z2NwLXVzLWVhc3QxOjAxSkgzQkFLSjBWUDUzNEdWTVBDWUQ5WTUz?cart_link_id=i3ho9H7P $90])
 +
* 2025-08: [https://memories.ai/luci/ Luci Pin]
 +
* 2025-08: [https://www.tainecklace.com/ Tai Necklace]
 +
* 2025-09: [https://friend.com/ Friend]
  
 
==Smart Glasses==
 
==Smart Glasses==
Line 27: Line 30:
 
* [https://solosglasses.com/ Solos] AirGo V smart glasses, [https://www.theverge.com/2024/12/10/24317805/solos-airgo-vision-chatgpt-ai-smart-glasses-price-availability with] vision/camera, ChatGPT integration
 
* [https://solosglasses.com/ Solos] AirGo V smart glasses, [https://www.theverge.com/2024/12/10/24317805/solos-airgo-vision-chatgpt-ai-smart-glasses-price-availability with] vision/camera, ChatGPT integration
 
* [https://raven.computer/ Raven Resonance]
 
* [https://raven.computer/ Raven Resonance]
 +
* [https://hallidayglobal.com/ Halliday] ($430)
 +
* Meta [https://www.projectaria.com/ Aria] Gen 2
 +
* 2025-06: [https://www.roadtovr.com/xiaomi-ai-glasses-meta-smart-glasses-features/ Xiaomi AI Glasses]
 +
* 2025-08: [https://www.microled-info.com/guangyu-gaowei-announces-coray-air2-glasses-full-color-microled-display-engine Guangyu Gaowei Coray Air2 glasses] (microLED)
 +
* 2025-09: [https://about.fb.com/news/2025/09/meta-ray-ban-display-ai-glasses-emg-wristband/ Meta Ray-Ban Display: AI Glasses With an EMG Wristband] ($800)
 +
 +
==VRs==
 +
* [https://www.bigscreenvr.com/ Bigscreen Beyond 2]
  
 
=UIs tailored to AI=
 
=UIs tailored to AI=
Line 57: Line 68:
 
* [https://github.com/browserbase/stagehand stagehand]: An AI web browsing framework focused on simplicity and extensibility
 
* [https://github.com/browserbase/stagehand stagehand]: An AI web browsing framework focused on simplicity and extensibility
 
* [https://github.com/Skyvern-AI/skyvern Skyvern]: Automate Browser-based workflows using LLMs and Computer Vision
 
* [https://github.com/Skyvern-AI/skyvern Skyvern]: Automate Browser-based workflows using LLMs and Computer Vision
 +
* Amazon [https://labs.amazon.science/blog/nova-act Nova Act]
 +
* [https://github.com/browserable/browserable Browserable]: Open source browser automation library for AI agents
  
 
==Full Desktop GUI==
 
==Full Desktop GUI==
Line 65: Line 78:
 
** [https://arxiv.org/abs/2501.12326 UI-TARS: Pioneering Automated GUI Interaction with Native Agents] ([https://github.com/bytedance/UI-TARS code])
 
** [https://arxiv.org/abs/2501.12326 UI-TARS: Pioneering Automated GUI Interaction with Native Agents] ([https://github.com/bytedance/UI-TARS code])
 
* [https://manus.im/ Manus AI]
 
* [https://manus.im/ Manus AI]
 +
* [https://github.com/langmanus/langmanus LangManus]: open-source agent (based on LangChain and LangGraph)
 
* [https://github.com/camel-ai/owl OWL (Optimized Workforce Learning)]: General Multi-Agent Assistance in Real-World Task Automation
 
* [https://github.com/camel-ai/owl OWL (Optimized Workforce Learning)]: General Multi-Agent Assistance in Real-World Task Automation
 
* OpenAI [https://platform.openai.com/docs/api-reference/responses responses API] and [https://platform.openai.com/docs/guides/agents agents SDK]
 
* OpenAI [https://platform.openai.com/docs/api-reference/responses responses API] and [https://platform.openai.com/docs/guides/agents agents SDK]
 +
* [https://github.com/bytebot-ai/bytebot?tab=readme-ov-file bytebot]: The computer use container
  
 
==Screen Record==
 
==Screen Record==
Line 75: Line 90:
 
* OpenAI [https://openai.com/index/introducing-operator/ Operator]
 
* OpenAI [https://openai.com/index/introducing-operator/ Operator]
 
* [https://convergence.ai/ Convergence AI] [https://proxy.convergence.ai/ Proxy] ([https://x.com/ai_for_success/status/1883824921396322493 examples])
 
* [https://convergence.ai/ Convergence AI] [https://proxy.convergence.ai/ Proxy] ([https://x.com/ai_for_success/status/1883824921396322493 examples])
 +
* [https://computer-agent.ai/ screenpipe Computer Agent]
 +
* [https://generalagents.com/ General Agents Co.] [https://generalagents.com/ace/ Ace]

Latest revision as of 12:19, 18 September 2025

A.k.a. HCI

Smart Wearables

Pendants, etc.

Smart Glasses

VRs

UIs tailored to AI

Example products with AI-first interfaces

  • Thread of examples
    • granola: AI notepad for meetings.
    • attio: A next-generation CRM platform that leverages AI to automate complex go-to-market tasks and enhance customer relationship management.
    • rabbitholes.ai: An AI-powered platform that facilitates deep, explorative conversations on an infinite canvas, enabling users to learn faster and delve deeper into topics.
    • tldraw.com: A free, instant collaborative whiteboarding tool for creating diagrams, flowcharts, and sketches with real-time collaboration.
    • herostuff.com: An AI-driven marketplace that allows users to scan, price, and list items for sale quickly using AI technology.
    • krea.ai: A platform that simplifies generative AI, enabling users to create and enhance images and videos for free.
    • superrandom.studio/venngenn: An AI tool that generates unique and creative images based on user prompts, offering various style and environment modifiers.
    • scrapybara.com/playground: An experimental AI-powered playground that allows users to interact with AI agents for various tasks, including web scraping and data extraction.
    • sdk.vercel.ai: An open-source AI SDK for TypeScript that provides tools to build AI-powered products, supporting multiple AI providers and frameworks.
    • midday.ai: A business management platform designed for freelancers, offering features like invoicing, time tracking, financial overviews, and an AI assistant.
    • dupe.com: A platform that helps users find similar products at lower prices, aiming to provide affordable alternatives to popular items.
  • Granola: The AI notepad for people in back-to-back meetings

AI Computer Use

Research

Browser

  • Helium: Light-weight web automation with Python; library for automating browsers (Chrome, Firefox) actions (enter values, click buttons, etc.)
  • Browser-Use (app)
  • Lightpanda Browser: open-source browser made for headless usage
  • Steel: The open-source browser API for AI agents & apps (build live web agents and browser automation tools)
  • stagehand: An AI web browsing framework focused on simplicity and extensibility
  • Skyvern: Automate Browser-based workflows using LLMs and Computer Vision
  • Amazon Nova Act
  • Browserable: Open source browser automation library for AI agents

Full Desktop GUI

Screen Record

Computer Use Agents