HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Fri, 21 Nov 2025 22:43:12 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"6920eb00-150d1" expires: Mon, 29 Dec 2025 22:02:19 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: 73F2:2C10E1:95DFB9:A820CD:6952F811 accept-ranges: bytes age: 0 date: Mon, 29 Dec 2025 21:52:19 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210045-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767045139.047691,VS0,VE210 vary: Accept-Encoding x-fastly-request-id: 94b1e91aea345cf7e07ee338127b579f8fe6f5ed content-length: 15799 Ziniu Hu's Website

Ziniu Hu's Website

About
Publications
Service

Ziniu Hu

About

I was working at xAI, reinforcing Large Language Models (e.g. Grok Code Fast 1, Grok 3-mini Reasoning, Grok 3 and Grok 2) to solve real-world problems.

I finished Postdoc at Caltech CMS hosted by Prof. Yisong Yue, during which I was also a visiting researcher at Google DeepMind. I received CS PhD degree at UCLA, where I had the fortune to be advised by Prof. Yizhou Sun and Prof. Kai-Wei Chang. I received my CS bachelor degree at Peking University, advised by Prof. Xuanzhe Liu. My research is generously supported by Amazon PhD Fellowship and Baidu Scholarship. My PhD thesis on Neural-Symbolic AI won the ACM KDD 2024 Dissertation Award - Runner Up .

Education

Ph.D. of Computer Science
Sept. 2018 -- May 2023

University of Calofornia, Los Angeles

Thesis: Make Knowledge Computable: Towards Differentiable Neural-Symbolic AI
B.Sc. of Computer Science
Sept. 2014 -- Jun. 2018

Peking University

Academic Awards

ACM SIGKDD 2024 Dissertation Award - Runner Up
Best Paper Award, NeurIPS 2023 Workshop (DL + Differential Equation)
Best Paper Award, SoCal NLP Symposium 2022
Best Student Paper Award, KDD 2020 Workshop (DL on Graphs)
Best Full Paper Award, WWW 2019

Services

Research Track Workflow Co-Chair: SIGKDD 2023
Area Chair of ICLR, NeurIPS, ACL 2025, ICLR, ICML 2026
NeurIPS 2022 Top Reviewer Award
Workshop Co-Organizer of Tool-VLM @ CVPR'24 and SSL @ WWW'21

Fellowships

Computing, Data, and Society Fellow at Caltech, 2023
Baidu Scholarship (10 PhD students worldwide), 2021
Amazon PhD Fellowship, 2021-2022
SenseTime Scholarship, 2018
May 4th Scholarship of Peking University, 2016

Selected Publications

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search

Zhenyu Hou*, Ziniu Hu*, Yujiang Li*, Rui Lu*, Jie Tang, Yuxiao Dong PDF CODE

Conference of the Association for Computational Linguistics (ACL 2025)

We propose TreeRL, a reinforcement learning framework that directly incorporates on-policy tree search for LLM RL training, as well as a cost-effective tree search approach that strategically branch from high-entropy tokens.

Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search

Jonathan Light, Min Cai, Weiqin Chen, Guanzhi Wang, Xiusi Chen, Wei Cheng, Yisong Yue, Ziniu Hu PDF CODE DEMO

International Conference on Learning Representations (ICLR 2025)

Covered by State of AI Report 2024 , published by Air Street Capital.

We propose Strategist, a method allowing LLMs to learn new skills for multi-agent games. With bi-level tree search approach, combining high-level strategic learning with low-level simulated self-play for feedback. It outperformed RL and other LLM-based approaches on Game of Pure Strategy and The Resistance: Avalon at action planning and dialogue generation.

Multi-Token Joint Speculative Decoding for Accelerating Large Language Model Inference

Zongyue Qin, Ziniu Hu, Zifan He, Neha Prakriya, Jason Cong, Yizhou Sun PDF CODE

International Conference on Learning Representations (ICLR 2025)

We propose a novel decoding that improves perplexity and downstream performance with 1.4 times faster and 1.5 times less energy cost compared to speculative decoding by considering joint probability of multiple tokens.

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

Zongyu Lin, Yao Tang, Xingcheng Yao, Da Yin, Ziniu Hu, Yizhou Sun, Kai-Wei Chang PDF

International Conference on Machine Learning (ICML 2025)

QLASS (Q-guided Language Agent Stepwise Search), is a framework that supercharges language agents at inference time. We build a process reward model to guide open language agents on complex interactive tasks by estimating the Q-value of each step without any human annotation.

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Dan Zhang, Sining Zhoubian, Ziniu Hu, Yisong Yue, Yuxiao Dong, Jie Tang PDF CODE

Conference on Neural Information Processing Systems (NeurIPS 2024)

In this paper, we develop a reinforced self-training approach, called ReST-MCTS*, based on integrating process reward guidance with tree search MCTS* for collecting higher-quality reasoning traces as well as per-step value to train policy and reward models. ReST-MCTS* circumvents the per-step manual annotation typically used to train process rewards by tree-search-based reinforcement learning: Given oracle final correct answers, ReST-MCTS* is able to infer the correct process rewards by estimating the probability this step can help lead to the correct answer. These inferred rewards serve dual purposes: they act as value targets for further refining the process reward model and also facilitate the selection of high-quality traces for policy model self-training.

SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models

Dan Zhang, Ziniu Hu, Sining Zhoubian, Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, Jie Tang PDF CODE

Conference on Neural Information Processing Systems (NeurIPS 2024, Dataset Track)

We use LLM to self-curated SciInstruct, a diverse and high-quality dataset of college-level mathematics, physics, chemistry, and formal proofs. Using SciInstruct to finetune the ChatGLM family of LLMs, we introduce SciGLM, a suite of scientific language models for college-level mathematical/scientific reasoning.

Physics-Informed Regularization for Domain-Agnostic Dynamical System Modeling

Zijie Huang*, Wanjia Zhao*, Jingdong Gao, Ziniu Hu, Xiao Luo, Yadi Cao, Yuanzhou Chen, Yizhou Sun, Wei Wang PDF CODE

Conference on Neural Information Processing Systems (NeurIPS 2024), Best Paper Award at NeurIPS 2023, Deep Learning and Differential Equations (DLDE) workshop

We propose a physical-law-guided regularization term corresponding to a soft constraint of time-reversal symmetry. The term is applied to GraphODE models for multi-agent dynamical systems and demonstrated as superior to several baselines on a variety of benchmarks, including the challenging pendulum problem.

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

Yihe Deng, Pan Lu, Fan Yin, Ziniu Hu, Sheng Shen, James Zou, Kai-Wei Chang, Wei Wang PDF CODE WEBSITE

Conference on Neural Information Processing Systems (NeurIPS 2024)

We introduce Self-Training on Image Comprehension (STIC), which self-constructs a preference dataset for image descriptions using unlabeled images. Preferred responses are generated through a step-by-step prompt, while dis-preferred responses are generated from either corrupted images or misleading prompts.

Can Large Language Model Agents Simulate Human Trust Behavior?

Chengxing Xie, Canyu Chen, Feiran Jia, Ziyu Ye, Shiyang Lai, Kai Shu, Jindong Gu, Adel Bibi, Ziniu Hu, David Jurgens, James Evans, Philip Torr, Bernard Ghanem, Guohao Li PDF CODE

Conference on Neural Information Processing Systems (NeurIPS 2024)

Under the framework of Trust Games, we discover that LLM agents can have high behavioral alignment with humans regarding trust behaviors, indicating the feasibility to simulate human trust behaviors with LLM agents

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Ziniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathi PDF

International Conference on Machine Learning (ICML 2024, Oral Presentation)

We introduces SceneCraft, an LLM Agent converting text descriptions into Blender-executable Python scripts which render complex scenes with up to a hundred 3D assets. SceneCraft can keep self-improving via Library Learning.

Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion

Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli S Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue PDF CODE DEMO

International Conference on Machine Learning (ICML 2024, Oral Presentation)

We study the problem of symbolic music generation, with a technical focus on non-differentiable rule guidance by Musical Rules (e.g., note density or chord progression). We propose Stochastic Control Guidance (SCG), a novel guidance method that only requires forward evaluation of rule functions that can work with pre-trained diffusion models in a plug-and-play way, thus achieving training-free guidance for non-differentiable rules for the first time.

SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models

Xiaoxuan Wang*, Ziniu Hu*, Pan Lu*, Yanqiao Zhu*, Jieyu Zhang, Satyen Subramaniam, Arjun R Loomba, Shichang Zhang, Yizhou Sun, Wei Wang PDF CODE & Dataset

International Conference on Machine Learning (ICML 2024)

Covered by Nature News Feature (15 November 2023)

We propose SciBench to systematically examine LLM's reasoning for complex scientific problem solving. SCIBENCH contains two carefully curated datasets: an open set featuring a range of collegiate-level scientific problems drawn from mathematics, chemistry, and physics textbooks, and a closed set comprising problems from undergraduate-level exams in computer science and mathematics.

AVIS: Autonomous Visual Information Seeking with Large Language Model Agent

Ziniu Hu, Ahmet Iscen, Chen Sun, Kai-Wei Chang, Yizhou Sun, David A. Ross, Cordelia Schmid, Alireza Fathi PDF Google AI Blog-Post

Conference on Neural Information Processing Systems (NeurIPS 2023)

we propose an autonomous information seeking visual question answering framework, AVIS. Our method leverages a Large Language Model (LLM) to dynamically strategize the utilization of external tools and to investigate their outputs, thereby acquiring the indispensable knowledge needed to provide answers to the posed questions.

Learning to Group Auxiliary Datasets for Molecule

Tinglin Huang, Ziniu Hu, Rex Ying PDF CODE

Conference on Neural Information Processing Systems (NeurIPS 2023)

We propose MolGroup to address the limited data problem in molecule property prediction by leveraging auxiliary datasets to improve performance on target datasets, via a routing mechanism w/ bi-level optimization.

Towards a Comprehensive Benchmark for FPGA Targeted High-Level Synthesis

Yunsheng Bai, Atefeh Sohrabizadeh, Zongyue Qin, Ziniu Hu, Yizhou Sun, Jason Cong PDF CODE & Dataset

Conference on Neural Information Processing Systems (NeurIPS 2023, Dataset Track)

High-level synthesis (HLS) aims to raise the abstraction layer in hardware design, enabling the design of domain-specific accelerators (DSAs) like FPGAs using C/C++ instead of hardware description languages. To enable machine learning models to predict design quality, we present HLSYN, a comprehensive dataset for training and evaluating design quality prediction models for hardware design.

AvalonBench: Evaluating LLMs Playing the Game of Avalon

Jonathan Light*, Min Cai*, Sheng Shen, Ziniu Hu PDF Game CODE DEMO

NeurIPS 2023, Foundation Models for Decision Making (FMDM) workshop

we introduce AvalonBench - a comprehensive game environment tailored for evaluating multi-agent LLM Agents. This benchmark incorporates: (1) a game environment for Avalon, (2) rule-based bots as baseline opponents, and (3) ReAct-style LLM agents with tailored prompts for each role.

REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory

Ziniu Hu, Ahmet Iscen, Chen Sun, Zirui Wang, Kai-Wei Chang, Yizhou Sun, Cordelia Schmid, David A. Ross and Alireza Fathi PDF Google AI Blog-Post PROJECT CODE (JAX/Scenic)

Conference on Computer Vision and Pattern Recognition (CVPR 2023), selected as Highlight.

We propose an end-to-end Retrieval-Augmented Visual Language Model (REVEAL) that learns to encode world knowledge into a large-scale memory, and to retrieve from it to answer knowledge-intensive queries. The key novelty is that the memory, retriever and generator are all pre-trained end-to-end to use a diverse set of multimodal knowledge sources, bringing significant gains.

Empowering Language Models with Knowledge Graph Reasoning for Open-Domain Question Answering

Ziniu Hu, Yichong Xu, Wenhao Yu, Shuohang Wang, Ziyi Yang, Chenguang Zhu, Kai-Wei Chang and Yizhou Sun PDF

Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

Best Paper Award at SoCal NLP Symposium 2022

We propose a novel symbolic Knowledge Graph (KG) reasoning layer that could be flexibly plugged into most existing Language Models (LMs) and allow LMs to interact with KG, unifying the retrieval and reasoning in a end-to-end framework. OREO-LM improves RoBERTa and T5 on various QA tasks, and the generated reasoning paths could help interpret the model's decision.

Improving Multi-Task Generalization via Regularizing Spurious Correlation

Ziniu Hu, Zhe Zhao, Xinyang Yi, Tiansheng Yao, Lichan Hong, Yizhou Sun, Ed H. Chi PDF

Conference on Neural Information Processing Systems (NeurIPS 2022, Spotlight Presentation)

We point out the unique challenges of spurious correlation problem in multi-task setting that influence generalization. We propose Multi-Task Causal Representation Learning (MT-CRL) framework to learn 1) disentangled neural modules; 2) Task-to-Module Causal Graph; 3) Regularize spurious correlation over learned causal graph.

Zero-shot Transfer Learning within a Heterogeneous Graph via Knowledge Transfer Networks

Minji Yoon, John Palowitch, Dustin Zelle, Ziniu Hu, Russ Salakhutdinov, Bryan Perozzi PDF CODE

Conference on Neural Information Processing Systems (NeurIPS 2022)

We propose a zero-shot transfer learning module for heterogeneous graph neural networks that transfers knowledge from label-abundant node types to zero-labeled node types through rich relational information given in a single heterogeneous graph.

Fuzzy Logic based Logical Query Answering on Knowledge Graph

Xuelu Chen, Ziniu Hu, Yizhou Sun PDF CODE

AAAI Conference on Artificial Intelligence (AAAI 2022, Oral Presentation)

We propose FuzzQE, a fuzzy logic based logical query embedding framework for answering FOL queries over KGs. FuzzQE define logical operators in a principled and learningfree manner, which could be trained with only KG without any complex queries.

Relation-Guided Pre-Training for Open-Domain Question Answering

Ziniu Hu, Kai-Wei Chang, Yizhou Sun PDF

Conference on Empirical Methods in Natural Language Processing (EMNLP-Finding 2021)

We propose RGPT-QA to synthesize QA pairs from relation triplets in WikiData and WikiPedia for pre-training Open-Domain QA Model and improves the QA performance, especially for questions with long-tail relations.

Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning

Da Yin, Liunian Li, Ziniu Hu, Nanyun Peng, Kai-Wei Chang PDF CODE Dataset

Conference on Empirical Methods in Natural Language Processing (EMNLP 2021, Oral Presentation)

we construct a Geo-Diverse Visual Commonsense Reasoning dataset (GD-VCR) to test Vision-Language models' ability to understand cultural and geo-location-specific commonsense. We find that the performance of SOTA VL models for non-Western regions (e.g., East Asia, South Asia, and Africa) is significantly lower than that for Western region.

GPT-GNN: Generative Pre-Training of Graph Neural Networks

Ziniu Hu, Yuxiao Dong, Kuansan Wang, Kai-Wei Chang, Yizhou Sun PDF CODE SLIDES

Conference on Knowledge Discovery and Data Mining (KDD 2020, Oral, Top-10 Cited Paper in KDD'20)

We introduce a self-supervised graph generation task to pre-train GNN. We factorize the likelihood of graph generation into two components: 1) attribute generation, and 2) edge generation, without lossing mutual dependency.

Heterogeneous Graph Transformer

Ziniu Hu, Yuxiao Dong, Kuansan Wang, Yizhou Sun PDF CODE (My PyG Impl.) CODE (DGL) CODE (PyG Re-Impl.) SLIDES

The Web Conference (WWW 2020, Most Cited Paper in WWW'20)

We present the Heterogeneous Graph Transformer (HGT) architecture for modeling Web-scale heterogeneous (nodes and edges have multiple types) and dynamic graphs. HGT could automatically learns important meta-paths for different downstream tasks.

Improving Neural Language Generation with Spectrum Control

Lingxiao Wang, Jing Huang, Kevin Huang, Ziniu Hu, Guangtao Wang, Quanquan Gu PDF SLIDES

The International Conference on Learning Representations (ICLR 2020)

We propose a novel spectrum control approach to address this degeneration problem. The core idea of our method is to directly guide the spectra training of the output embedding matrix with a slow-decaying singular value prior distribution through a reparameterization framework.

Layer-Dependent Importance Sampling for Training Deep and Large Graph Convolutional Networks

Difan Zou*, Ziniu Hu*, Yewen Wang, Song Jiang, Yizhou Sun, Quanquan Gu PDF CODE

Conference on Neural Information Processing Systems (NeurIPS 2019)

We propose LAyer-Dependent ImportancE Sampling (LADIES). Based on the sampled nodes in the upper layer, LADIES selects their neighborhood nodes, compute the importance probability accordingly and samples a fixed number of nodes within them.

Few-Shot Representation Learning for Out-Of-Vocabulary Words

Ziniu Hu , Ting Chen, Kai-Wei Chang, Yizhou Sun PDF CODE

Conference of the Association for Computational Linguistics (ACL 2019)

We formulate the learning of OOV embedding as a few-shot regression problem by predicting an oracle embedding vector (defined as embedding trained with abundant observations) based on only K contexts. Specifically, we use Model-Agnostic Meta-Learning (MAML) for adapting a hierachical Transformer to the new corpus fast and robustly.

Unbiased LambdaMART: An Unbiased Pairwise Learning-to-Rank Algorithm

Ziniu Hu , Yang Wang, Qu Peng, Hang Li PDF CODE

The Web Conference (WWW 2019)

We propose a novel framework for pairwise learning-to-rank. Our algorithm, Unbiased LambdaMART can jointly estimate the biases at click positions and the biases at unclick positions, and learn an unbiased ranker.

Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification

Zhenpeng Chen*, Sheng Shen*, Ziniu Hu , Xuan Lu, Qiaozhu Mei, Xuanzhe Liu PDF CODE

The Web Conference (WWW 2019, Best Full Paper Award)

We employ emoji prediction task as the instrument to learn both the cross-language and language-specific sentiment patterns in different languages.

Listening to Chaotic Whispers: A Deep Learning Framework for News-oriented Stock Trend Prediction

Ziniu Hu , Weiqing Liu, Jiang Bian, Xuanzhe Liu, Tie-Yan Liu PDF

Conference on Web Search and Data Mining (WSDM 2018).

We designed a Hybrid Attention Networkss(HAN) to predict the stock trend based on the sequence of recent related news, with self-paced learning mechanism to guide efficient learning.

Teaching Experience

Lecturer for UCLA CS 145: Introduction to Data Mining, 2024 Spring.
Teaching Assistant for UCLA CS 249: Graph Neural Networks, 2021 Winter.
Teaching Assistant for UCLA CS 146: Introduction to Machine Learning, 2019 Fall.

Academic Services

Area Chair of WWW 2024,2025, NeurIPS 2025, ICLR 2025, ACL 2025
Journal Associate Editor of Transaction on Big Data (TBD)
Research Track Workflow Co-Chair: SIGKDD 2023 (ACM Conference on Knowledge Discovery and Data Mining)
Program Committee / Reviewer: Neurips (Top Reviewer Award @ 2022), ICML, ICLR, KDD, ACL (+Rolling Review), EMNLP, AAAI, IJCAI, WWW, CIKM (Annual reviewer since 2019, reviewed for 100+ conference papers)
Journal Reviewer: TKDE, TKDD, TOIS, TPAMI, TCS, TBD, JAIR (Reviewed for 20+ Journal papers)
Program Committee Co-Chair: SSL @ WWW 2021 (Workshop on Self-Supervised Learning for the Web)
Senior PC & Meta-Reviewer: KnowledgeNLP @ AAAI 2023 (Workshop on Knowledge Augmented Methods for NLP)
Reading Group Organizer @ UCLA-DM Lab from 2018 to 2022.

Invited Talks

Enhancing LLM Reasoning via Reinforcement Learning and Tree Search
- Keynote at KDD 2024 Generative AI Day (recording link)
- Contributed Talk at INFORMS APS 2025
Make Knowledge Computable: Differentiable Neural-Symbolic Reasoning
- USC AI Seminars at USC Information Sciences Institute
- ByteDance AI Lab, AI Seminar
Self-Supervised Learning and Logical Reasoning over Knowledge Graphs
- DataFunTalk, Graph Learning Seminar
- AI Time, Tsinghua University

HOME
ABOUT
AUCTIONS
SHIPPING
FEES
TOOLS
HOW
FAQ
CONTACT

Original Source | Taken Source