Multilingual and Multi-Speaker Generative Dubbing
Kineton srl · Naples, IT
Job description
Lo scopo principale è creare una pipeline di doppiaggio generativo che mantenga il timbro originale degli speaker, traducendo e ricostruendo l’audio in un’altra lingua con sincronizzazione labiale di base.
La tesi comprende diarizzazione e trascrizione del parlato, traduzione, allineamento tempo-fonema e rendering finale con TTS.
Argomento principale: AI/ML, IA generativa, ASR, TTS neurale, Speech Translation.
Corso di studio e requisiti candidato: Informatica, Ingegneria Informatica. Solida base in Python e ML, librerie PyTorch/TensorFlow; gradite competenze su dataset audio.
Sede tirocinio: Napoli e Milano.
___________
The main goal is to create a generative dubbing pipeline that maintains the original speaker’s timbre, translating and reconstructing the audio in another language with basic lip synchronisation.
The thesis includes speech diarization and transcription, translation, time- phoneme alignment, and final rendering with TTS.
Main Topic: AI/ML, Generative AI, ASR, Neural TTS, Speech Translation.
Course of Study and Candidate Requirements: Computer Science or Computer Engineering. Solid foundation in Python and ML, PyTorch/TensorFlow libraries; skills with audio datasets are a plus.
Internship Location: Naples and Milan.
ML/AI Work links you to the employer's original posting — always verify the details there before applying.
More Core AI Engineering roles
View all →AI Engineer
Visiotech España · Remote · Madrid
Software R&D Engineer, RTL Optimization Tools
NVIDIA · Austin, US
Staff AI Engineer
Acquia · Remote · Boston
Senior Developer Relations Manager — Security and AI Software
NVIDIA · Oakland, US
AI Developer
Amana Living · Perth, AU
Senior Full-Stack Developer (AI/LLM)
NOUS SRL · Bari, IT