HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 301 server: GitHub.com content-type: text/html location: https://venturamor.github.io/NLEye/ x-github-request-id: FD2E:2F7ECD:875AB6:980FE6:69522BB1 accept-ranges: bytes age: 0 date: Mon, 29 Dec 2025 07:20:17 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210036-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1766992818.622979,VS0,VE198 vary: Accept-Encoding x-fastly-request-id: 47970f654df0ef18233f6a11e85d139354b13c08 content-length: 162 HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Wed, 16 Oct 2024 10:21:22 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"670f93a2-4914" expires: Mon, 29 Dec 2025 07:30:17 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: 6005:2BC55:8728C2:97DE9B:69522BB1 accept-ranges: bytes age: 0 date: Mon, 29 Dec 2025 07:20:18 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210036-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1766992818.850707,VS0,VE225 vary: Accept-Encoding x-fastly-request-id: 5e028a7cdfbc30da2108142665dd3d9b4d90b4ce content-length: 4597 NL-EYE: ABDUCTIVE NLI ON IMAGES

NL-EYE: ABDUCTIVE NLI FOR IMAGES

Mor Ventura, Michael Toker, Nitay Calderon, Zorik Gekhman, Yonatan Bitton, Roi Reichart

Technion - Israel Institute of Technology
Google Research

Paper Code

HF Dataset arXiv

NL-Eye real examples. Example from every reasoning category.

NL-Eye Examples. Each example represents a reasoning category, and contains 3 images: premise (left column), plausible hypothesis (middle column) and implausible hypothesis (right column). The plausible hypotheses are framed in green while the implausible in red. The explanations are provided below each sample.

Abstract

Will a Visual Language Model (VLM)-based bot warn us about slipping if it detects a wet floor? Recent VLMs have demonstrated impressive capabilities, yet their ability to infer outcomes and causes remains underexplored. To address this, we introduce NL-Eye, a benchmark designed to assess VLMs' visual abductive reasoning skills. NL-Eye adapts the abductive Natural Language Inference (NLI) task to the visual domain, requiring models to evaluate the plausibility of hypothesis images based on a premise image and explain their decisions. NL-Eye consists of 350 carefully curated triplet examples (1,050 images) spanning diverse reasoning categories: physical, functional, logical, emotional, cultural, and social. The data curation process involved two steps - writing textual descriptions and generating images using text-to-image models, both requiring substantial human involvement to ensure high-quality and challenging scenes. Our experiments show that VLMs struggle significantly on NL-Eye, often performing at random baseline levels, while humans excel in both plausibility prediction and explanation quality. This demonstrates a deficiency in the abductive reasoning capabilities of modern VLMs. NL-Eye represents a crucial step toward developing VLMs capable of robust multimodal reasoning for real-world applications, including accident-prevention bots and generated video verification.

Motivation: Will a VLM-based bot warn us about slipping if it detects a wet floor?

NL-Eye data curation workflow scheme.

Models and baselines by their input strategy and reasoning approach.

Two input setups of the VLM: (1) Triplet setup (left) and (2) Pairs setup (right). The triplet is provided two times with different orders of the hypotheses (A and B). The pairs should output a plausibility score.

Main results: scores for vision-based experiments. VLMs are greatly outperformed by humans.

VLMs struggle with the Emotional and Functional categories but perform better on Social and Cultural ones and on parallel reasoning.

Text-based: Performace for plausibility prediction in the triplet setup. Predictor models perform well when using the gold-description.

Failure factors of model explanation for incorrect plausibility prediction.

Fully annotated example from the NL-Eye benchmark, featuring textual descriptions, three images, gold-standard explanations, reasoning categories, and temporal attributes (direction and duration).

BibTeX

@misc{ventura2024nleye,
        title={NL-Eye: Abductive NLI for Images},
        author={Mor Ventura and Michael Toker and Nitay Calderon and Zorik Gekhman and Yonatan Bitton and Roi Reichart},
        year={2024},
        eprint={2410.02613},
        archivePrefix={arXiv},
        primaryClass={cs.CV}
    }

This page was built using the Academic Project Page Template which was adopted from the Nerfies project page. You are free to borrow the of this website, we just ask that you link back to this page in the footer.
This website is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

HOME
ABOUT
AUCTIONS
SHIPPING
FEES
TOOLS
HOW
FAQ
CONTACT

Original Source | Taken Source