Exporters From Japan
Wholesale exporters from Japan   Company Established 1983
CARVIEW
Select Language

Selected publications

    Multi-modal Models

  • Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

    Christopher Clark*♥, Jieyu Zhang*♥, Zixian Ma*♥, Jae Sung Park♥, Mohammadreza Salehi♥, Rohun Tripathi♥, Sangho Lee♥, Jason Ren, Chris Dongjoo Kim, Yinuo Yang, Vincent Shao, Yue Yang, Weikai Huang, Ziqi Gao, Taira Anderson, Jianrui Zhang, Jitesh Jain, George Stoica, Winston Han, Ali Farhadi, Ranjay Krishna.

    [Blog] [Report] [Models] [Data] [Playground]

  • latte teaser figure

    LATTE: Learning to Think with Vision Specialists

    Zixian Ma, Jianguo Zhang, Zhiwei Liu, Jieyu Zhang, Juntao Tan, Manli Shu, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Caiming Xiong, Ranjay Krishna, Silvio Savarese

    EMNLP 2025 Oral Presentation | SynData4CV workshop @ CVPR 2025

    [Website] [PDF] [Code]

  • task-me-anything teaser figure

    Synthetic Visual Genome

    Jae Sung Park, Zixian Ma, Linjie Li, Chenhao Zheng, Cheng-Yu Hsieh, Ximing Lu, Khyathi Chandu, Quan Kong, Norimasa Kobori, Ali Farhadi, Yejin Choi, Ranjay Krishna

    CVPR 2025

    [Website] [PDF] [Code]

  • task-me-anything teaser figure

    Task Me Anything

    Jieyu Zhang, Weikai Huang*, Zixian Ma*, Oscar Michel, Dong He, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna.

    NeurIPS 2024 (Datasets & Benchmarks Track) | Video-Language Models @ NeurIPS 2024 Oral Presentation

    [Website] [PDF] [Code]

  • m&ms examples

    m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

    Zixian Ma, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna

    ECCV 2024 | SynData4CV workshop @ CVPR 2024

    [Website] [PDF] [Code]

  • sugar crepe image

    SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality

    Cheng-Yu Hsieh*, Jieyu Zhang*, Zixian Ma, Aniruddha Kembhavi, Ranjay Krishna

    NeurIPS 2023 (Datasets & Benchmarks Track)

    [PDF] [Code]

  • crepe image

    CREPE: Can Vision-Language Foundation Models Reason Compositionally?

    Zixian Ma*, Jerry Hong*, Mustafa Omer Gul*, Mona Gandhi, Irena Gao, Ranjay Krishna

    CVPR 2023 Highlight [top 2.5%]

    [PDF] [Code]

    Human-AI Interaction

  • task-me-anything teaser figure

    Rethinking Human Preference Evaluation of LLM Rationales

    Ziang Li*, Manasi Ganti*, Zixian Ma*, Helena Vasconcelos, Qijia He, Ranjay Krishna

    XLLM-Reason-Plan workshop @ COLM 2025 Best Paper Award (Honorable Mention)

    [PDF]

  • collaboration scaling teaser figure

    Completion ≠ Collaboration: Scaling Collaborative Effort with Agents

    Shannon Zejiang Shen, Valerie Chen, Ken Gu, Alexis Ross, Zixian Ma, Alex Gu, Chenglei Si, Jillian Ross, Jocelyn J Shen, Wayne Chi, Andi Peng, Ameet Talwalkar, Tongshuang Wu, David Sontag

    ResponsibleFM (Foundation Models) workshop @ NeurIPS 2025 Best Paper Award

    [Website][PDF][Code]

  • model sketching image

    Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design

    Michelle S. Lam, Zixian Ma, Anne Li, Izequiel Freitas, Dakuo Wang, James A. Landay, Michael S. Bernstein

    CHI 2023

    [PDF]

  • ELIGN image

    ELIGN: Expection Alignment as a Multi-agent Intrinsic Reward

    Zixian Ma, Rose Wang, Li Fei-Fei, Michael Bernstein, Ranjay Krishna

    NeurIPS 2022

    [PDF] [Code]