Online applications are invited for the Speech Recognition Intern FY24 at SONY India. The application link will be active for a limited time only. Apply at the earliest.
About Sony Research India
Sony Research India is driving cutting-edge research and development in various locations around the globe, including laboratories in Japan, the United States, Europe, and Asia. We endeavor to create new technology, products, and services while sustaining Sony Group’s diverse businesses in electronics, entertainment, and financial fields.
For our research center to blaze a trail in the latest technologies, we seek to foster the growth of a diverse pool of engineering talent and create a technology talent bank to drive research excellence worldwide. Sony Research India is offering outstanding career opportunities around frontline technologies such as AI and data analytics.
Since its establishment in 1994, Sony India, a wholly owned subsidiary of Sony Corporation in Japan, has positioned itself as a premium brand across product categories such as television, digital imaging, personal audio, home entertainment, gaming, car audio and professional solutions.
Responsibilities
- Together with the research scientists or other team members, work on the basic development activities on a daily basis, participate in speech generation related research topics of the project, like development of Dubbing Pipeline, Text-to-Speech and Voice Cloning Systems.
Required Skills: Intern FY24 at SONY
Must-Have:
- Pytorch
- Deep Learning/Machine Learning frameworks: Pytorch, Tensorflow, scikit-learn
- Prior experience in developing speech-to-speech dubbing pipeline
- Knowledge of the state-of-the-arts in Speech-to-Speech, Text-to-Speech, Voice Cloning
- Knowledge of generative AI models
Good to have:
- Expertise in Pytorch
- Prior experience in development of Dubbing Service.
- Experience in Streamlit or FastAPI.
Education
- Ph.D. or Master from Computer Science background or bachelor’s with hands-on experience on Speech-to-Speech Dubbing Pipeline and/or Generative AI models with an applications audio/speech.
Duration
- 6 months starting January 2024 last week
- 9:00 to 18:00 (Monday to Friday) fulltime
How to Apply?
Interested candidates can directly apply through this link.
Location
Remote/Hybrid working from Bangalore, India.