HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 accept-ranges: bytes age: 1 cache-control: public,max-age=0,must-revalidate cache-status: "Netlify Edge"; fwd=miss content-encoding: gzip content-type: text/html; charset=UTF-8 date: Mon, 29 Dec 2025 04:37:28 GMT etag: "cc3b2c6aa40240cb234f5a32bc551f3e-ssl-df" permissions-policy: accelerometer=(), camera=(), geolocation=(), gyroscope=(), magnetometer=(), microphone=(), payment=(), usb=() referrer-policy: strict-origin-when-cross-origin server: Netlify strict-transport-security: max-age=31536000; includeSubDomains vary: Accept-Encoding x-content-type-options: nosniff x-nf-request-id: 01KDM6B6SWW360HHE2R84KZKW2 x-xss-protection: 1; mode=block Jaemin Cho

Search

Jaemin Cho

Publications
CV

Light Dark Automatic

Jaemin Cho

Young Investigator @ AI2
Incoming Assistant Professor @ JHU

I am joining the Computer Science Department at Johns Hopkins University as an Assistant Professor in Fall 2026. Until that, I am spending my gap year at AI2 Fall 2025 - Fall 2026.

I will be recruiting Ph.D. students for Fall 2026. Please see this page for more details. I plan to attend ICCV 2025 (in Hawaii) and NeurIPS 2025 (in San Diego). Please feel free to reach out if you would like to chat in person!

My research focuses on multimodal AI, integrating diverse data types (e.g., images, videos, text, audio, and motion) to develop models that are interpretable, controllable, and scalable. My recent interests include learning action knowledge (e.g., robot actions, human actions, physical laws) from unlabeled videos, learning to reason in non-textual chain-of-thoughts at scale, and developing AI-based softwares that enhance human productivity in various applications (e.g., film, design, music, choreography, medicine).

Below is a summary of my research during my PhD years.

(1) Scalable Multimodal Frameworks – Modern AI models must meet the growing demand for thousands of capabilities. My research has addressed this challenge by introducing: (a) Unified generative frameworks that flexibly accommodate diverse modalities and tasks, using a single architecture and a generative objective – VL-T5 (ICML 2021) / X-LXMERT (EMNLP 2020) / TVLT (NeurIPS 2022 Oral) and (b) Efficient finetuning frameworks that significantly reduce parameter and memory requirements for creating task-specific models – VL-Adapter (CVPR 2022) / LST (NeurIPS 2022) / Ctrl-Adapter (ICLR 2025 Oral)

(2) Faithful Multimodal Reasoning – Scaling alone is not enough. Large models that rely on black-box reasoning and encode all knowledge within their parameters often struggle with basic tasks and produce hallucinations. My research makes their reasoning process more accurate and interpretable by introducing: (a) Planning-based frameworks that decompose complex visual generation problems into faithful, human-interpretable step-by-step reasoning processes – VPGen (NeurIPS 2023) / VideoDirectorGPT (COLM 2024) / DiagrammerGPT (COLM 2024) / Video-MSG(2024) and (b) Retrieval-augmented generation (RAG) frameworks that enhance accuracy and factuality by retrieving relevant information before generating outputs – M3DocRAG (2024) / HiREST (CVPR 2023)

(3) Evaluation and Refinement of Multimodal Generation – With recent advancements in multimodal generation models, conventional evaluation metrics have been often saturated and no longer provide meaningful insights into future research direction. To this end, my research introduces: (a) Fine-grained evaluation frameworks that comprehensively measure model skills in multiple dimensions to uncover detailed strengths and weaknesses – DALL-Eval (ICCV 2023) / VPEval (NeurIPS 2023) / DSG (ICLR 2024) / LayoutBench (CVPRW 2024 Oral) / FineCapEval (Findings of NAACL 2022) / M3DocVQA (2024) / CAPTURe (ICCV 2025) and (b) Automatic model refinement frameworks that use these evaluations to detect models’ weaknesses and refine their reasoning process – EnvGen (COLM 2024) / DataEnvGym (ICLR 2025 Spotlight) / SELMA (NeurIPS 2024) / VideoRepair (2024)

I’m also sharing some of my past application materials below. I know these are far from perfect, but I hope this helps you in your applications!

Academic Job Market (written in Dec 2024): Research Statement, Teaching Statement, DEI Statement
Bloomberg Data Science Fellowship (written in Apr 2023): Research Statement
PhD Admission (written in Dec 2019): Statement of Purpose.
- This collection of SOPs for CS PhD programs looks useful as well!

News

Sep 2025 - 1 paper accepted at NeurIPS 2025:
- Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents
Sep 2025 - I’m starting my gap year at AI2 PRIOR team as an Young Investigator!
Aug 2025 - 1 paper accepted at Findings in EMNLP 2025:
- Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning
Aug 2025 - New preprints:
- Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents
- RotBench: Evaluating Multimodal Large Language Models on Identifying Image Rotation
Jul 2025 - New preprints:
- A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality
Jun 2025 - 1 paper accepted at ICCV 2025:
- CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting
Jun 2025 - New preprints:
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval
- Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning
May 2025 - New preprints:
- EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
Apr 2025 - New preprints:
Feb 2025 - 2 papers have been accepted to ICLR 2025:
- Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
  - as Oral presentation (top 1.8% of 10,000+ submissions)
- DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
  - as Spotlight presentation (top 5.1% of 10,000+ submissions)

Education

Ph.D. in Computer Science, 2025
University of North Carolina at Chapel Hill
B.S. in Industrial Engineering, 2018
Seoul National University

Publications

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents
Han Lin, Jaemin Cho, Amir Zadeh, Chuan Li, Mohit Bansal
In NeurIPS, 2025

Jaemin Cho

Young Investigator @ AI2Incoming Assistant Professor @ JHU

News

Publications

Young Investigator @ AI2
Incoming Assistant Professor @ JHU