Skip to content

VozLingua is an AI-powered voice translation application that converts audio in Spanish to text and translates it into English, Italian, French, and Japanese. It utilizes advanced voice recognition and audio synthesis technology to facilitate communication across language barriers.

Notifications You must be signed in to change notification settings

felipesanchez-dev/InterVoz-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Typing SVG

Voice Translator

Description

This project uses artificial intelligence to transcribe an audio file in Spanish, translate it into several languages (English, Italian, French, and Japanese), and convert the translations into audio files. It is built using Gradio, Whisper, and the ElevenLabs API.

Setup

Obtain your ElevenLabs API key and replace "Your ElevenLabs API key here" in the file with your generated key from ElevenLabs .
Ensure you have the Wisper model and the necessary libraries installed.

Functionality

The main workflow of the application is as follows:

  • Transcription: The user records or uploads an audio file in Spanish.
  • Translation: The transcribed text is translated into English, Italian, French, and Japanese.
  • Audio Conversion: The translations are converted into audio files and returned to the user.

Main Functions

  • translator(audio_file): Main function that handles the transcription and translation of the audio.
  • text_to_speach(text, language): Converts text to speech using the ElevenLabs API.

Virtualization Guide

If you want to virtualize your Python environment, follow the instructions for your operating system below.

MacOS

  1. Open your terminal.

  2. Navigate to your project directory:

    $ cd /path/to/your/project
  3. Create a virtual environment:

    $ python3 -m venv venv
  4. Activate the virtual environment:

    $ source venv/bin/activate
  5. To deactivate the virtual environment, simply run:

    $ deactivate

Windows

  1. Open your terminal.

  2. Navigate to your project directory:

     $ cd C:\path\to\your\project
  3. Create a virtual environment:

    $ python3 -m venv venv
  4. Activate the virtual environment:

    $ venv\Scripts\activate
  5. To deactivate the virtual environment, simply run:

    $ deactivate

Linux

  1. Open your terminal.

  2. Navigate to your project directory:

     $ cd C:\path\to\your\project
  3. Create a virtual environment:

    $ python3 -m venv venv
  4. Activate the virtual environment:

    $ source venv/bin/activate
  5. To deactivate the virtual environment, simply run:

    $ deactivate

Requirements

Make sure you have the following Python packages installed:

  • gradio
  • whisper
  • translate
  • elevenlabs

Define ElevenLabs API Key

ELEVENLABS_API_KEY = "Your ElevenLabs API key here"

API key

You can install the dependencies using "pip":

$ pip install gradio whisper translate elevenlab

CONTRIBUTIONS

Contributions are welcome. Feel free to open an issue or submit a pull request.

LICENSE

This project is licensed under the MIT License. See the LICENSE file for more details.

CONTACT ME

Icone VS-Code Icone VS-Code Icone VS-Code

About

VozLingua is an AI-powered voice translation application that converts audio in Spanish to text and translates it into English, Italian, French, and Japanese. It utilizes advanced voice recognition and audio synthesis technology to facilitate communication across language barriers.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages