Researcher [LLM, VLM, and Omni-Model Training] (MWS AI)

МТС

Researcher [LLM, VLM, and Omni-Model Training] (MWS AI)

Москва, Подсосенский переулок, 23с5

Метро: Курская

Описание вакансии

MWS AI is a part of the MTS Web Services ecosystem where future AI solutions are created. Voice assistants and chat bots can answer any question of the client, and computer vision - recognize faces and emotions. We brought together strong developers, the most powerful supercomputer, and breakthrough ideas to make people’s lives more comfortable and safe and help business.

We are seeking a Researcher to drive the development and training of Large Language Models (LLMs), Vision-Language Models (VLMs), and omni-modal models. to help develop and train Large Language Models (LLMs), Vision-Language Models (VLMs), and omni-modal models. The successful candidate will work at the intersection of research and practical systems, contributing to model design, experimentation, and scalable training pipelines under the guidance of more senior researchers and engineers.

This role is ideal for candidates who already have some hands-on experience with deep learning and are motivated to grow into large-scale LLM/VLM research and engineering.

Key Responsibilities:
  • Contribute to the design, training, and evaluation of LLMs, VLMs, and omni-modal models
  • Run experiments, analyze results, and help iterate on model architectures and training recipes
  • Assist in developing and optimizing training and inference pipelines for latency, throughput, and cost efficiency
  • Help implement and maintain distributed training workflows on multi-GPU and (over time) multi-node clusters
  • Collaborate with engineers and product teams to transfer research outcomes into real-world applications and products
  • Contribute to writing internal reports, technical documentation, and, when appropriate, research papers
  • Stay current with the latest advances in machine learning, deep learning, and large-scale model training
Required Qualifications:
  • Bachelor’s degree (completed or in final year) or Master’s degree in Computer Science, Electrical Engineering, Mathematics, or a related field, or equivalent practical experience
  • Good oral and written English at a minimum level of B1-В2
  • Solid background in mathematics, including linear algebra, probability, statistics, and optimization
  • Strong programming skills in Python, with experience in deep learning frameworks such as PyTorch (preferably), TensorFlow, or JAX
  • Practical experience training deep neural networks (e.g., for NLP, computer vision, recommendation, or related tasks), including data preprocessing, model implementation, and debugging
  • Familiarity with core machine learning concepts (overfitting, regularization, evaluation metrics, cross-validation, etc.)
  • Ability to read and implement methods from recent ML research papers
  • Ability to work both independently and collaboratively in a fast-paced research and engineering environment
  • Good communication skills, including the ability to clearly present experimental results and technical ideas
Beneficial Qualifications:
  • Hands-on experience with training or fine-tuning LLMs, VLMs, or other large foundation models (e.g., using HuggingFace, LLaMA Factory, or similar frameworks)
  • Familiarity with distributed and parallel training concepts (e.g., data/model parallelism, mixed-precision training, gradient accumulation, checkpoints)
  • Experience using job schedulers or orchestration tools (e.g., SLURM, Kubernetes, Ray) in any setting (uni-versity, research lab, or industry)
  • Exposure to model efficiency techniques such as quantization, pruning, distillation, low-rank methods, or sparsity
  • Experience contributing to open-source ML projects or maintaining reproducible ML codebases
  • Experience building and deploying smaller-scale ML models into real products, services, or internal tools

Extra Beneficial Qualifications:

  • Ph.D. in Computer Science, Electrical Engineering, Mathematics, or a related field with a focus on machine learning or deep learning
  • Prior research or engineering experience specifically focused on large-scale LLMs, VLMs, or omni-modal models (pre-training, alignment, evaluation, or compression)
  • Deep hands-on experience with distributed training on multi-GPU or multi-node clusters (e.g., FSDP, Deep-Speed, ZeRO, pipeline or tensor parallelism)
  • Strong track record of work on model compression, acceleration, or efficient inference for large models (e.g., production-grade quantization, sparsity, distillation, or low-rank adaptation)
  • Publications at top-tier conferences such as NeurIPS, ICML, ICLR, CVPR, ICCV, ACL, EMNLP, or similar venues, or strong evidence of research impact (e.g., widely used open-source projects)
  • Experience leading ML projects end-to-end, from problem definition and data collection to deployment and monitoring

What will you get from us:

  • 5/2, partial or full remote working scheduling, flexible start of the day
  • Fixed monthly salary + quarterly bonuses
  • Comfortable office 5 minutes walk from the metro station (Moscow)
  • Medical insurance with dentistry, travel insurance, life insurance, discounts on car insurance, etc.
  • Participation in conferences and meetups, trainings at the expense of the company
  • Free mobile communications
  • Annual paid leave - 31 days
  • Special offers and discounts from MTS partners
Посмотреть контакты работодателя

Адрес

Похожие вакансии

Институт искусственного интеллекта AIRI

Исследователь (Research Scientist) в области RL

Институт искусственного интеллекта AIRI

  • Москва

  • Не указана

Рекомендуем
СБЕР
  • Москва

  • Не указана

Рекомендуем
СБЕР
  • Москва

  • Не указана

Рекомендуем
  • Москва

  • от 3000 USD

AI - инженер

Онивс Рус

  • Москва

  • от 3000 USD

Axenix (ранее Accenture)

Аналитик Data & AI Strategy

Axenix (ранее Accenture)

  • Москва

  • от 3000 USD

билайн

AI Product Manager

билайн

  • Москва

  • от 3000 USD

СБЕР
  • Москва

  • от 3000 USD

ИЦ АЙ-ТЕКО

AI Agent Developer (Python)

ИЦ АЙ-ТЕКО

  • Москва

  • от 3000 USD

СБЕР
  • Москва

  • от 3000 USD

ИЦ АЙ-ТЕКО

AI Agent Developer

ИЦ АЙ-ТЕКО

  • Москва

  • от 3000 USD

МТС
  • Москва

  • от 3000 USD

Хотите оставить вакансию?

Заполните форму и найдите сотрудника всего за несколько минут.
Оставить вакансию