| CARVIEW |
Biography
I am a Research Scientist at Google Research in Tel-Aviv where I work on multimodal consistency.
My research is centered on improving large vision-and-language models. I develop feedback models for text-to-image and text-to-video applications, specifically designed to enhance the alignment of visual outputs with their corresponding textual prompts. Additionally, I work on multimodal factuality, including visual understanding and image or video-to-text evaluation, ensuring that the generated text is factually correct and attributable to trustworthy textual or visual sources.
I completed my PhD in The Hebrew University of Jerusalem, Israel. During my time there, I had the privilege of being advised by Dr. Roy Schwartz and Dr. Gabriel Stanovsky. My PhD talk "Bridging Vision and Language with Data: From Perception to Understanding" π¬ record is available here. I did my MSc with Prof. Michael Elhadad and Prof. Eitan Bachmat, at the Ben Gurion University.
Download my complete CV: link.π Download my bio: link.
-
PhD in Computer Science (Vision-and-Language), 2020-2023
The Hebrew University of Jerusalem, Israel
-
MSc in Computer Science (Natural Language Processing), Magna cum laude, 2018-2019
Ben Gurion University of the Negev, Israel
-
BSc in Computer Science, 2015-2018
Ben Gurion University of the Negev, Israel
Students
I've had the opportunity to collaborate with several MSc and PhD students towards their publication goals:
2. Wenbo (Gordon) Hu (University of California, Los Angeles) 1
3DLLM-Mem4. Aviv Slobodkin (Bar-Ilan University) 1
RefVNLI5. Moran Yanuka (Tel-Aviv University) 1
Bridging the Visual Gap6. Mor Ventura (Technion β Israel Institute of Technology) 1
NL-Eye7. Orr Zohar (Stanford University) 1
Video-STaR11. Oren Sultan (The Hebrew University of Jerusalem) 1
ParallelPARC12. Netta Madvil (The Hebrew University of Jerusalem) 1
Read, Look or Listen?If youβd like to work together on vision-and-language research, send me an email.
Papers by Venue
24 peer-reviewed papers Β· 2021 β 2025
2022 1
NeurIPS 1
WinoGAViLPublications
π Selected as Featured Presentation
Work Experience
Developed a virtual fitness trainer, specializing in 2D/3D pose estimation, action recognition, error correction, on-device deployment and more.
Invited Talks
My talk π¬ record is available here.
Others
This project participated in Starter - Jump course and won 1st place in the final Demo Day event.
Press coverage: telecomnews, israeldefense, sheva7.















