Show HN: Lanturn – A smart headlamp running voice+vision on ESP32
github.com·5h·
Discuss: Hacker News
Flag this post

Lanturn 🔦 (Work in Progress)

A hackathon project connecting the Gemini Live API to an ESP32 Atoms3r-CAM device for voice + vision conversations on embedded hardware.

Overview

Lantern demonstrates real-time AI voice conversations with vision running on an ESP32 microcontroller. It uses:

  • ESP32 Atoms3r‑CAM (mic, speaker, camera)
  • Pipecat (voice/media orchestration)
  • Gemini Live API (multimodal speech + vision)

Features

  • ✅ Real-time voice conversations
  • Real-time vision processing with camera (Work in Progress)
  • ✅ Gemini Live multimodal AI integration
  • ✅ ESP32 hardware support
  • ✅ WebRTC audio + vision streaming
  • ✅ Automatic greeting on connection
  • ✅ Google Search tool call
  • AI can see and describe what the camera shows

Architecture

Similar Posts

Loading similar posts...