About me

Hi, my name is Alexander! I am a PhD student at Constructor University, Bremen. My research is mainly focused on Deep Learning methods in Natural Language Processing. For the last few years, I have been working on adaptation of diffusion models to discrete domains, such as text or code. I am trying to understand how the text-based latent space differs from the image latent space, and to find ways to minimize this difference. I am very interested in the progress of LLMs and I share what I learn by teaching the NLP course at HSE University and also by writing reviews here.

Since 2021, I have been working at the Bayesian Methods Research Group under the supervision of Dmitry Vetrov. I've received both BSc and MSc degrees from the Faculty of Computer Science at HSE University. During my undergrad, I worked at Yandex on the recommendation systems and at HSE lab on the word sense induction problem.

If you have questions about my work or you just want to chat, please feel free to reach me via email. I will be happy to answer any questions!

 

Publications

    Smoothie: Smoothing Diffusion on Token Embeddings for Text GenerationAlexander Shabalin, Viacheslav Meshchaninov, and Dmitry Vetrov. 2025, preprint.
    Compressed and Smooth Latent Space for Text Diffusion ModelingViacheslav Meshchaninov, Egor Chimbulatov, Alexander Shabalin, Aleksandr Abramov, and Dmitry Vetrov. 2025, NeurIPS.
    TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model EncodingsAlexander Shabalin, Viacheslav Meshchaninov, Egor Chimbulatov, Vladislav Lapikov, Roman Kim, Grigory Bartosh, Dmitry Molchanov, Sergey Markov, and Dmitry Vetrov. 2025, AAAI (oral).
    [Re] “Towards Understanding Grokking”Alexander Shabalin, Ildus Sadrtdinov, and Evgeniy Shabalin. 2023, MLRC (outstanding paper honorable mention).