You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[2023/11] We have updated a new version of VITATECS which is generated using ChatGPT. The previous version generated by OPT-175B has been moved to data_opt folder.
Data
This repo contains 6 jsonl files under data folder, each of which corresponds to an aspect of temporal concepts (Direction, Intensity, Sequence, Localization, Compositionality, Type).
Each line of the jsonl file is a json object, which contains the following fields:
src_dataset: the name of the source dataset (VATEX or MSRVTT)
video_name: the name of the video in the source dataset
caption: the original caption of the video
counterfactual: the generated counterfactual description of the video
aspect: the aspect of temporal concepts that is modified
Example (indented for better presentation):
{
"src_dataset": "VATEX",
"video_name": "i0ccSYMl0vo_000027_000037.mp4",
"caption": "A woman is placing a waxing strip on a man's leg.",
"counterfactual": "A woman is removing a waxing strip from a man's leg.",
"aspect": "Direction"
}