HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Wed, 13 Aug 2025 02:44:18 GMT
access-control-allow-origin: *
etag: W/"689bfc02-1612"
expires: Mon, 29 Dec 2025 19:53:10 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: CDC0:272D88:94C19A:A6C434:6952D9C9
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 19:43:10 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210030-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767037391.545587,VS0,VE202
vary: Accept-Encoding
x-fastly-request-id: c10971b82248aa66a5b7c10a50b449dd79cec68b
content-length: 2069
Vladimir Malinovskii
Vladimir Malinovskii
I am an ML Resident at Yandex Research , specializing in quantization algorithms. I am pursuing a Master's degree in Computer Science at the Higher School of Economics (HSE) , through a joint program with the Yandex School of Data Analysis .
Previously, I was a Software Engineer in the Infrastructure team at Yandex , contributing to the development and maintenance of high-scale deployment systems.
I hold a Bachelor of Science in Applied Mathematics and Physics with a minor in Data Analysis from the Moscow Institute of Physics and Technology (MIPT) .
Email /
CV /
Scholar /
Github /
Linkedin
PV‑Tuning: Beyond Straight‑Through Estimation for Extreme LLM Compression
Vladimir Malinovskii* , Denis Mazur*, Ivan Ilin*, Denis Kuznedelev, Konstantin Burlachenko, Kai Yi, Dan Alistarh, and Peter Richtarik
NIPS, 2024, Oral | Arxiv | Code
Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Vladimir Malinovskii , Andrei Panferov, Ivan Ilin, Han Guo, Peter Richtárik, Dan Alistarh
NAACL, 2025 | Arxiv
Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models
Alina Shutova, Vladimir Malinovskii , Vage Egiazarian, Denis Kuznedelev, Denis Mazur, Nikita Surkov, Ivan Ermakov, Dan Alistarh
Arxiv, 2025 | Arxiv