Research scientist and software developer.  
Speech / video processing and generation, conversational agents, semi-supervised and unsupervised learning, private federated learning. 
Update: I am hiring strong ML engineers to work on speech and language modeling stack. I am searching for interns to join in 2026 and work on speech and language.
Industry and Research Experience
- Apple, Staff Research Scientist (Oct 2023 - present)
- Apple, Senior Research Scientist (Sep 2021 - Oct 2023)
- Fundamental AI Research, Postdoctoral Researcher (Aug 2019 - Aug 2021)
 Speech recognition and natural language processing for speech
 Advisors: Ronan Collobert, Gabriel Synnaeve
- Fundamental AI Research, AI Resident (Sep 2018 - Aug 2019)
 Speech recognition and natural language processing for speech
 Advisors: Ronan Collobert, Gabriel Synnaeve
- NTechLab, Machine Learning Expert (Aug 2017 - Sep 2018)
 Face recognition and facial attributes predictions with deep learning at top-1 face recognition team
- Yandex & CERN, Researcher (Apr 2013 - May 2017)
 Machine learning for High Energy Physics studies at the Large Hadron Collider: particle identification system, trigger system (online identification which collisions worth being stored), specific rare decays search (high-level data analysis), and B mesons oscillations (main subject of the LHCb studies)
- Membership at Large Hadron Collider beauty (LHCb) collaboration, CERN (2013 - 2018)
Education
- Ph.D. in Computer Science, Lomonosov Moscow State University (2017)
 Faculty of Computational Mathematics and Cybernetics
 Advisor: Eugene Moiseev
 Thesis: Research on solutions of non-classical boundary-value problems for mixed type equations
- M.S. in Computer Science, Yandex School of Data Analysis, 5.0/5.0 (2014)
- M.S. in Computer Science, Lomonosov Moscow State University, 5.0/5.0 (2013)
 Faculty of Computational Mathematics and Cybernetics
- Summer School on Bayesian Methods in Deep Learning (2017)
- Rome-Moscow School of Matrix Methods and Applied Linear Algebra (2012, 2013)
Software
- pfl4asr: private federated learning for speech recognition
- mlx-data: framework agnostic data loading library brought to you by Apple machine learning research; it works with PyTorch, Jax or MLX
- Flashlight: a fast, flexible machine learning library written entirely in C++
 blog post
- Wav2letter++: speech recognition toolkit and recipes for papers
- BDT reweigter tutorial
- HepML: specific machine learning tools for purposes of high energy physics
- REP: ipython-based environment for conducting data-driven research in a consistent and reproducible way
Public Talks
- Efficient Speech Generative Modeling With Little Tokenization, Summer School on Multimodal Foundation Models and Generative AI (MoroccoAI), Rabat (2025)
- Low-Latency Conversational Agent, TTIC Summer Workshop on Foundations of Speech and Audio Foundation Models, Chicago (2025)
- Private Federated Learning for Speech Recognition, FLute: Federated Learning for Audio Understanding workshop, ICASSP, Hyderabad (2025)
- Speech Generative Modeling with Little Tokenization, MIT CSAIL, Spoken Language Systems Group, Boston (2024)
- Efficient Speech Processing, Johns Hopkins University, Center for Language and Speech Processing, Baltimore (2024)
- Private Federated Learning for Speech Recognition, Apple Workshop on Privacy-Preserving Machine Learning, Cupertino (2024)
- Simple and Efficient Self-Training Approaches for Speech Recognition, Third Workshop on Efficient Natural Language and Speech Processing (ENLSP-III), NeurIPS, New Orleans (2023)
- Simple and Efficient Pseudo-Labeling for Speech Recognition, On-Device Workshop MLSys, Miami (2023)
- Machine Learning at Apple, WiML@ICML, Baltimore (2022)
- CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings, ReWork Deep Learning Summit, San Francisco (2022)
- Positional Embedding in Transformer-based Models, Higher School of Economics (2021)
- slimIPL: Language-Model-Free Iterative Pseudo-Labeling, NTR Lab and Tomsk University (2021, in Russian)
- Pseudo-Labeling for Speech Recognition, NTR Lab and Tomsk University (2021, in Russian)
- Machine Learning in Science and Industry, Heidelberg University (2017)
- LHCb Topological Trigger Optimization, Data&Science: Large Hadron Collider, public series, Yandex, Moscow (2016)
- Classifier Output Calibration to Probability, Heavy Flavour Data Mining workshop, Zurich University (2016)
- Machine Learning and Optimization of LHC Real-Time Event Stream Filter for New Physics Discoveries, Machine Learning: Prospects and Applications Conference, Berlin (2015)
Selected Publications
- Pelikan*, M., Azam*, S.S., Feldman, V., Silovsky, J., Talwar, K., Brinton, C. G., and Likhomanenko*, T. Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers, and Gradient Clipping. Thirty-Ninth Conference on Neural Information Processing Systems (NeurIPS), 2025. 
 overview, code
- Azam*, S.S., Pelikan*, M., Feldman, V., Talwar, K., Silovsky, J. and Likhomanenko*, T. Federated Learning for Speech Recognition: Revisiting Current Trends Towards Large-Scale ASR. In International Workshop on Federated Learning in the Age of Foundation Models in Conjunction with NeurIPS, 2023. Oral. 
 overview, video, slides, poster
- Azam, S.S., Likhomanenko, T., Pelikan, M. and Silovsky, J. Importance of Smoothness Induced by Optimizers in FL4ASR: Towards Understanding Federated Learning for End-to-End ASR, ASRU 2023.
- Ramapuram*, J., Danieli*, F., Dhekane*, E., Weers*, F., Busbridge*, D., Ablin*, P., Likhomanenko*, T., Digani, J., Gu, Z., Shidani, A. and Webb, R. Theory, Analysis, and Best Practices for Sigmoid Self-Attention. In International Conference on Representation Learning (ICLR), 2025. 
 code
- Busbridge*, D., Ramapuram*, J., Ablin*, P., Likhomanenko*, T., Dhekane, E.G., Suau, X. and Webb, R. How to Scale Your EMA. Thirty-Seventh Conference on Neural Information Processing Systems (NeurIPS), 2023. Spotlight. 
 overview, video, slides, poster
- Zhai*, S., Likhomanenko*, T., Littwin*, E., Busbridge*, D., Ramapuram*, J., Zhang, Y., Gu, J. and Susskind, J. Stabilizing Transformer Training by Preventing Attention Entropy Collapse. In International Conference on Machine Learning (ICML), 2023. 
 overview, video, poster, code
- Zhai, S., Jaitly, N., Ramapuram, J., Busbridge, D., Likhomanenko, T., Cheng, J.Y., Talbott, W., Huang, C., Goh, H. and Susskind, J.M. Position Prediction as an Effective Pretraining Strategy. In International Conference on Machine Learning (ICML), 2022, pp. 26010-26027. PMLR. Spotlight. 
 overview, video, poster
- Kahn, J.D., Pratap, V., Likhomanenko, T., Xu, Q., Hannun, A., Cai, J., Tomasello, P., Lee, A., Grave, E., Avidov, G., Steiner, B., Liptchinsky, V., Synnaeve, G., Collobert, R. Flashlight: Enabling Innovation in Tools for Machine Learning. In International Conference on Machine Learning (ICML), 2022, pp. 10557-10574. PMLR. (Spotlight) 
 video, presentation, poster, code
- Likhomanenko, T., Xu, Q., Synnaeve, G., Collobert, R. and Rogozhnikov, A. CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings. Thirty-Fifth Conference on Neural Information Processing Systems (NeurIPS), 2021. 
 openreview, video, presentation, code
- Rogozhnikov, A., Likhomanenko, T. InfiniteBoost: building infinite ensembles with gradient descent. arXiv preprint arXiv:1706.01109. 2017. 
 code
- Garg, S., Gheini, M., Emmanuel, C., Likhomanenko, T., Gao, Q. and Paulik, M. Generating Gender Alternatives in Machine Translation. 5th Workshop on Gender Bias in Natural Language Processing at ACL 2024.
- Likhomanenko, T., Carlson, L., Bai, R.H., Gu, Z., Tran, H., Aldeneh, Z., Zhang, Y., Zhang, R., Zheng, H. and Jaitly, N. ChipChat: Low-Latency Cascaded Conversational Agent in MLX. ASRU (demo track) 2025.
- Bai, H., Gu, Z., Likhomanenko, T., and Jaitly, N., 2025. SpeakStream: Streaming Text-to-Speech with Interleaved Data. arXiv preprint arXiv:2505.19206. 
 demo, demo-source
- Gupta, A., Likhomanenko, T., Yang, K., Bai, H., Aldeneh, Z. and Jaitly, N., 2024. Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis. arXiv preprint arXiv:2411.17690. 
 demo, demo-source
- Bai*, H., Likhomanenko*, T., Zhang, R., Gu, Z., Aldeneh, Z. and Jaitly, N., 2024. dMel: Speech Tokenization made Simple. arXiv preprint arXiv:2407.15835. 
 code, demo, demo-source
- Gu, Z., Likhomanenko, T., and Jaitly, N., 2025. Omni-Router: Sharing Routing Decisions in Sparse Mixture-of-Experts for Speech Recognition. ASRU 2025.
- Chi, HG., Aldeneh, Z., Likhomanenko, T., Rudovic, O., Higuchi, T., Chen, LW., Watanabe, S., Abdelaziz, AH. 2025. DiceHuBERT: Distilling HuBERT with a Self-Supervised Learning Objective. Interspeech 2025. Oral
- Chen, L.W., Higuchi, T., Bai, H., Abdelaziz, A.H., Rudnicky, A., Watanabe, S., Likhomanenko, T., Theobald, B.J. and Aldeneh, Z. Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models. ICASSP 2025.
- Aldeneh, Z., Thilak, V., Higuchi, T., Theobald, B.J. and Likhomanenko, T. Towards Automatic Assessment of Self-Supervised Speech Models using Rank. ICASSP 2025.
- Aldeneh, Z., Higuchi, T., Jung, J.W., Chen, L.W., Shum, S., Abdelaziz, A.H., Watanabe, S., Likhomanenko, T. and Theobald, B.J. Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels. ICASSP 2025.
- Gu, Z., Likhomanenko, T., Bai, H., McDermott, E., Collobert, R. and Jaitly, N., 2024. Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition. arXiv preprint arXiv:2405.15216.
- Aldeneh, Z., Higuchi, T., Jung, J.W., Seto, S., Likhomanenko, T., Shum, S., Abdelaziz, A.H., Watanabe, S. and Theobald, B.J. Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features? Interspeech 2024.
- Rouditchenko, A., Collobert, R. and Likhomanenko, T., AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition. AVGenL: Audio-Visual Generation and Learning Workshop at ECCV 2024.
- Gheini, M., Likhomanenko, T., Sperber, M. and Setiawan, H. Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data. ACL Findings, 2023. 
 overview
- Likhomanenko, T., Lugosch, L. and Collobert, R. Unsupervised ASR via Cross-Lingual Pseudo-Labeling, 2023. arXiv preprint arXiv:2305.13330.
- Berrebbi, D., Collobert, R., Jaitly, N., Likhomanenko, T. More Speaking or More Speakers? ICASSP 2023. 
 overview
- Berrebbi, D., Collobert, R., Bengio, S., Jaitly, N., Likhomanenko, T. Continuous Pseudo-Labeling from the Start. ICLR 2023. 
 overview, video, slides, poster
- Likhomanenko, T., Collobert, R., Jaitly, N., Bengio, S. Continuous Soft Pseudo-Labeling in ASR. I Can’t Believe It’s Not Better Workshop at NeurIPS 2022. 
 video, poster
- Lugosch, L., Likhomanenko, T., Synnaeve, G. and Collobert, R. Pseudo-Labeling for Massively Multilingual Speech Recognition. ICASSP 2022. 
 blog post, code
- Pratap, V., Xu, Q., Likhomanenko, T., Synnaeve, G. and Collobert, R. Word Order Does Not Matter For Speech Recognition. ICASSP 2022.
- Manohar, V., Likhomanenko, T., Xu, Q., Hsu, W.N., Collobert, R., Saraf, Y., Zweig, G. and Mohamed, A., 2021. Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition. ASRU 2021.
- Likhomanenko, T., Xu, Q., Kahn, J., Synnaeve, G. and Collobert, R. slimIPL: Language-model-free iterative pseudo-labeling. Interspeech 2021. 
 video, poster, code
- Likhomanenko*, T., Xu*, Q., Pratap*, V., Tomasello, P., Kahn, J., Avidov, G., Collobert, R. and Synnaeve, G. Rethinking evaluation in asr: Are our models robust enough? Interspeech 2021. 
 video, poster, code
- Hsu, W.N., Sriram, A., Baevski, A., Likhomanenko, T., Xu, Q., Pratap, V., Kahn, J., Lee, A., Collobert, R., Synnaeve, G. and Auli, M., 2021. Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training. Interspeech 2021.
- Xu, Q., Baevski, A., Likhomanenko, T., Tomasello, P., Conneau, A., Collobert, R., Synnaeve, G. and Auli, M., 2021, June. Self-training and pre-training are complementary for speech recognition. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 3030-3034). IEEE. 
 video
- Talnikar, C., Likhomanenko, T., Collobert, R. and Synnaeve, G., 2021, June. Joint masked cpc and ctc training for asr. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 3045-3049). IEEE. 
 video, poster, presentation
- Xu, Q., Likhomanenko, T., Kahn, J., Hannun, A., Synnaeve, G. and Collobert, R., 2020. Iterative Pseudo-Labeling for Speech Recognition. Proc. Interspeech 2020, pp.1006-1010. 
 video, code
- Pratap, V., Xu, Q., Kahn, J., Avidov, G., Likhomanenko, T., Hannun, A., Liptchinsky, V., Synnaeve, G., Collobert, R. (2020) Scaling Up Online Speech Recognition Using ConvNets. Proc. Interspeech 2020, 3376-3380. 
 video, blog post, news
- Kahn, J., Rivière, M., Zheng, W., Kharitonov, E., Xu, Q., Mazaré, P.E., Karadayi, J., Liptchinsky, V., Collobert, R., Fuegen, C. and Likhomanenko, T., 2020, May. Libri-light: A benchmark for asr with limited or no supervision. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 7669-7673). IEEE. 
 presentation, blog post, code
- Synnaeve*, G., Xu*, Q., Kahn*, J., Likhomanenko*, T., Grave*, E., Pratap, V., Sriram, A., Liptchinsky, V. and Collobert, R. End-to-end asr: from supervised to semi-supervised learning with modern architectures. SAS Workshop ICML 2020. 
 video, code
- Likhomanenko, T., Synnaeve, G. and Collobert, R., 2019. Who Needs Words? Lexicon-Free Speech Recognition. Proc. Interspeech 2019, pp.3915-3919. 
 presentation, blog post, code
- Derkach, D., Hushchyn, M., Likhomanenko, T., Rogozhnikov, A., Kazeev, N., Chekalina, V., Neychev, R., Kirillov, S., Ratnikov, F. and LHCb collaboration. Machine-Learning-based global particle-identifiritcation algohms at the LHCb experiment. Journal of Physics: Conference Series. 2018. Vol. 1085. No. 4. P. 1-5. 
 ACAT 2017, poster
- Likhomanenko, T., Derkach, D., Rogozhnikov, A. Inclusive Flavour Tagging Algorithm. Journal of Physics: Conference Series, 2016. 
 ACAT 2016, poster, code
- LHCb collaboration (2016). Search for decays of neutral beauty mesons into four muons, JHEP 03 (2017) 001.
- Likhomanenko, T., Ilten, P., Khairullin, E., Rogozhnikov, A., Ustyuzhanin, A., Williams, M. LHCb Topological Trigger Reoptimization. Journal of Physics: Conference Series, 2015. 
 CHEP 2015, presentation, code
- CMS collaboration, LHCb collaboration. Observation of the rare Bs0→ μ+ μ− decay from the combined analysis of CMS and LHCb data. Nature, 2015.
- Likhomanenko, T., Rogozhnikov, A., Baranov, A., Khairullin, E., & Ustyuzhanin, A. Reproducible Experiment Platform. Journal of Physics: Conference Series (Vol. 664, No. 5, p. 052022). 
 CHEP 2015, poster
- LHCb collaboration. Search for the lepton flavour violating decay τ−→ μ− μ+ μ−. Journal of High Energy Physics, 2015.
- Likhomanenko, T., Rogozhnikov, A., Baranov, A., Khairullin, E., Ustyuzhanin, A. Improving reproducibility of data science experiments, ICML 2015 AutoML Workshop, 2015 
 poster spotlight
- Moiseev, E.I., Likhomanenko, T.N. Eigenfunctions of the Gellerstedt problem with an inclined-type change line. Integral Transforms and Special Functions, 2017, pp. 1–8.
- Moiseev E. I., Likhomanenko T. N. On the basis property of a two-part trigonometric series. Doklady Mathematics, 2016, Vol. 94, No. 1, pp. 1–4. 
 oral talk, International scientific conference Actual Problems in Theory of Partial Differential Equations, dedicated to the centenary of Andrey V. Bitsadze, 2016
- Moiseev, E.I., Likhomanenko, T.N. Eigenfunctions of the Tricomi problem with an inclined type change line. Differential Equations, 2016, Vol. 52, No. 10, pp 1323– 1330. 
 oral talk, International scientific conference Actual Problems in Theory of Partial Differential Equations, dedicated to the centenary of Andrey V. Bitsadze, 2016
- Moiseev, E.I., Likhomanenko, T.N. On the basis property of a trigonometric system arising in the Frankl problem. Differential Equations, 2013, Vol. 49, No. 3, pp. 325–331. 
 oral talk, AMEE-2013 and Lomonosov-2013
- Moiseev E.I., Likhomanenko T.N. A nonlocal boundary value problem for the Lavrent’ev-Bitsadze equation. Doklady Mathematics, 2012, Vol. 86, No. 2, pp. 635–637. 
 oral talk, AMEE-2012 and Lomonosov-2012
Teaching
- DeepLearn Autumn School, Self-, Weakly-, Semi-Supervised Learning in Speech Recognition (Oct 2022)
- Heidelberg University, Grad Days, Machine learning in Science and Industry, invited lecturer (2017)
 lectures
- Imperial College London, Introduction to Machine Learning, TA (2016, 2017)
 lectures/seminars 2016, lectures/seminars 2017
- Yandex School of Data Analysis, Machine learning in High Energy Physics, lecturer (2016)
- Lund University, Summer School on Machine Learning in High Energy Physics (MLHEP), program committee & lecturer (2016)
 lectures/seminars
- Saint Petersburg Academic University, Summer School on Machine Learning in High Energy Physics (MLHEP), organizing committee & lecturer (2015)
 lectures/seminars
Research Activities
- Transactions on Machine Learning Research (TMLR) 2021-now (Expert Reviewer)
- Journal of Artificial Intelligence Research 2023
- NeurIPS 2021, 2022 (top-8% reviewer), 2023 (top-8% reviewer)
- ICLR 2021, 2022 (highlighted reviewer), 2023-2024
- ICLR Blogposts 2023, 2024
- ICML 2022-2023
- Interspeech 2020-2022, 2023 (top-2% reviewer), 2024-2025
- ICASSP 2021-2022, 2023 (outstanding reviewer), 2024-2025
- Machine Learning and the Physical Sciences workshop NeurIPS 2019-2020, 2022-2024
- SynS and ML Workshop ICML 2023
- Vision-based InduStrial InspectiON (VISION) Workshop CVPR 2023
- CHIME 2023, 2024
- BayLearn 2022-2024
- ASRU 2025
- An advisor in the LHCb statistics and machine learning working group (2016-2017)
- ICML 2024, 2025
- NeurIPS 2024, 2025
- NeurIPS Datasets and Benchmarks 2023, 2024
- Vision-based InduStrial InspectiON (VISION) Workshop CVPR 2023
- Vision-based InduStrial InspectiON (VISION) Workshop ECCV 2024
- ICASSP 2025
- TMLR Action Editor Sep 2024 - now
- Career Mentorship, Interspeech, Rotterdam (2025)
- WiML, Career Mentorship, ICML, Vancouver (2025)
- WiML, Research Mentorship, NeurIPS, New Orleans (2023)
- LatinX in AI, Mentorship Hour (Panel), ICML, Honolulu (2023)
- LatinX in AI, CV Research workshop, CVPR, New Orlean (2022)
- Industry panel, TTIC Summer Workshop on Foundations of Speech and Audio Foundation Models, Chicago (2025)
- Challenges and opportunities of federated learning for Audio Understanding, workshop "FLute: Federated Learning for Audio Understanding", ICASSP, Hyderabad (2025)
- Failure Modes in the Age of Foundation Models, workshop "I Can’t Believe It’s Not Better (ICBINB): Failure Modes in the Age of Foundation Models", NeurIPS, New Orleans (2023)
- Mentorship Hour, LatinX in AI, ICML, Honolulu (2023)
- On-Device Workshop MLSys, Miami (2023)
- 1st workshop and challenge on Vision-based InduStrial InspectiON, CVPR 2023
- 2st workshop on Vision-based InduStrial InspectiON, ECCV 2024
- Apple Workshop on Natural Language Understanding 2024
Kaggle Competition "Flavours of Physics"
- research/technical support
- award committee member
- co-organizer of ALEPH workshop at NeurIPS 2015
- starter-kit for competition
- Li-Wei Chen, summer internship, Apple 2024 (co-advising)
- Akshita Gupta, summer internship, Apple 2024 (co-advising with Navdeep Jaitly, Richard Bai, Karren Yang)
- Zijin Gu, AI/ML Residency, Apple 2023-2024 (co-advising with Navdeep Jaitly)
- Andrew Rouditchenko, summer internship, Apple 2023
- Lingxiao Zhao, summer internship, Apple 2023 (co-advising)
- Chun-wei Ho, summer internship, Apple 2023 (co-advising with Navdeep Jaitly and Ronan Collobert), Apple 2025 (co-advising with Masood Delfarah)
- Sheikh Shams Azam, AI/ML Resident, Apple 2022-2023 (co-advising with Honza Silovsky)
- Dan Berrebbi, summer internship, Apple 2022
- Mozhdeh Gheini, summer internship, Apple 2022 (co-advising with Matthias Sperber and Hendra Setiawan); Apple 2023 (co-advising)
- Colby Bunbary, summer internship, Apple 2022 (co-advising)
- Loren Lugosch: summer internship, Facebook AI Reserch 2021 (co-advising with Ronan Collobert and Gabriel Synnaeve); summer internship, Apple 2022 (co-advising with Ronan Collobert)
- Chaitanya Talnikar, AI Residency, Facebook AI Reserch 2019-2020 (co-advising with Ronan Collobert and Gabriel Synnaeve)
In News
- Interview to Republic (in Russian)
- Q&A with AI Residents
- About paper "Rethinking Evaluation in ASR: Are Our Models Robust Enough?"
- About kaggle challenge "Flavours of physics"
- About paper "LHCb Topological Trigger Reoptimization"
Honors & Awards
- 2025 Breakthrough Prize in Fundamental Physics (LHCb collaboration)
- Winner of Accelerate your code international competition, Intel (2012)
- Best student of Computer Science faculty, Lomonosov Moscow State University (2012)
- The winner (Regional stage) of All-Russian Programming contest (2007, 2008)