A.I.D
🌍 Project A.I.D. (Autonomous Image Description)
Making images accessible in Indian languages - because accessibility shouldn’t be limited to English.
🎯 The Problem
450 million people in India speak Hindi or Bengali as their primary language. When visually impaired users rely on screen readers, most image descriptions are only available in English - creating a massive accessibility gap.
Current solutions:
- ❌ Image alt-text is usually in English only
- ❌ Screen readers struggle with Indian languages
- ❌ Manual translation is slow and expensive
- ❌ Existing tools don’t support Indic scripts well
💡 The Solution
Project A.I.D. uses AI to:
- 📸 Analyze any uploaded image
- 🤖 Generate detailed English descriptions using Vision-Language Models
- 🌐 T…
A.I.D
🌍 Project A.I.D. (Autonomous Image Description)
Making images accessible in Indian languages - because accessibility shouldn’t be limited to English.
🎯 The Problem
450 million people in India speak Hindi or Bengali as their primary language. When visually impaired users rely on screen readers, most image descriptions are only available in English - creating a massive accessibility gap.
Current solutions:
- ❌ Image alt-text is usually in English only
- ❌ Screen readers struggle with Indian languages
- ❌ Manual translation is slow and expensive
- ❌ Existing tools don’t support Indic scripts well
💡 The Solution
Project A.I.D. uses AI to:
- 📸 Analyze any uploaded image
- 🤖 Generate detailed English descriptions using Vision-Language Models
- 🌐 Translate descriptions into Hindi and Bengali
- 🔊 Provide text-to-speech in all three languages
Built for real people. Built for India.
✨ Features
- 🖼️ Smart Image Analysis - AI-powered image understanding
- 🌏 Multilingual Support - English, Hindi (हिंदी), and Bengali (বাংলা)
- 🔊 Text-to-Speech - Listen to descriptions in your language
- 📱 Simple Interface - Clean, accessible design
- 🚀 Fast Processing - Get results in seconds
- 🔒 Privacy First - Images processed securely, not stored
🛠️ Tech Stack
- Backend: Python, Flask
- AI/ML: Hugging Face Transformers (BLIP model)
- Translation: Google Translate API (googletrans)
- Frontend: HTML, CSS, JavaScript
- Text-to-Speech: Web Speech API
📦 Installation
Prerequisites
- Python 3.8+
- pip
Setup
- Clone the repository
git clone https://github.com/yourusername/project-aid.git
cd project-aid
- Create a virtual environment (recommended)
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
- Install dependencies
pip install -r requirements.txt
- Set up environment variables (optional)
cp .env.example .env
# Edit .env if you want to use custom API keys
- Run the application
python app.py
- Open in browser