Our team TTS Model Engineer and contribute to advancing the quality of neural voice synthesis in GigaChat. You will work on core model improvements, multimodal integration, and innovative research projects that push the boundaries of speech technology.
Key Responsibilities:
- Develop and optimize TTS models to enhance performance and naturalness.
- Benchmark and surpass industry standards in speech synthesis quality.
- Research and implement novel techniques in deep learning for audio, including voice cloning, low-resource adaptation, and reinforcement learning.
- Collaborate on cross-functional projects such as training acceleration and multimodal AI integration.
- Share expertise through internal seminars, technical publications, and community engagement (e.g., Habr, Telegram).
Qualifications & Skills:
- Proficiency in Python and C++, with a solid foundation in algorithms and applied mathematics.
- Expertise in Deep Learning, particularly in audio/speech-related models.
- Hands-on experience in training and deploying production-grade ML models.
- Broad interdisciplinary knowledge (e.g., NLP, linguistics, Russian language, physics/biology of speech).
- A track record of publications (conferences, journals, or preprints) is a plus.
We Offer:
- Flexible work arrangements: Hybrid or fully remote options.
- Competitive compensation: Annual salary reviews and performance-based bonuses.
- Professional growth: Access to 400+ courses via SberUniversity for continuous learning.
- Health & wellness: Extended insurance (DMS), family coverage, corporate pension plan, and gym/relaxation facilities.
- Exclusive benefits:
- Mortgage support (up to 7% preferential rate).
- SberPrime+ subscription and partner discounts.
Москва
Не указана
Москва
Не указана