The Seeing AI project is an Android TTS (Text-to-Speech) OCR (Optical Character Recognition) Converter System designed to assist individuals with visual impairments by converting printed text into spoken words. This innovative application addresses the growing need for accessible reading solutions as the number of visually impaired individuals increases due to various factors.
System FunctionalityThe system operates by capturing images through the device's camera. It employs the Google Cloud Vision API for OCR, which accurately detects and extracts text from the captured images. Once the text is identified, it is converted into speech using TTS technology, allowing users to listen to the content through the device's speaker or headphones.
Key FeaturesUser-Friendly Interface: The application is designed for simplicity, allowing navigation using the device's volume buttons. For instance:
- Pressing the volume down once repeats the text.
- Pressing it twice captures a new image.
- Pressing the volume up exits the application.
- User-Friendly Interface: The application is designed for simplicity, allowing navigation using the device's volume buttons. For instance:
Pressing the volume down once repeats the text.
Pressing it twice captures a new image.
Pressing the volume up exits the application. - Versatile Text Recognition: The system can read various types of text, including printed documents, digital displays, and signboards, providing on-the-go assistance.
- Technology Stack: Developed using Java and Android Studio, the front end is created with XML, while the back end utilizes the Google Cloud Vision API for OCR and the built-in TTS capabilities of Android devices.
- Increased Independence: The system empowers visually impaired individuals to interact more freely with their environment by enabling them to read texts that would otherwise be inaccessible.
- Portable and Convenient: As a mobile application, it can be used anywhere, making it a practical solution for daily challenges faced by visually impaired users.
Despite its advantages, the system has some limitations:
- Image Capture Quality: If the image is not captured properly, the OCR process may yield inaccurate results.
- Volume Button Navigation: While intuitive for some, the volume button feature may be confusing for others, potentially hindering usability.
In conclusion, the Seeing AI project represents a significant advancement in assistive technology for visually impaired individuals. By transforming text into speech, it enhances access to information and fosters greater independence, ultimately improving the quality of life for users.
Comments