Datasets:
The full dataset viewer is not available (click to read why). Only showing a preview of the rows.
Error code: JobManagerCrashedError
Need help to make the dataset viewer work? Open a discussion for direct support.
image_id
string
| question_id
int32
| question
string
| question_tokens
sequence
| image
image
| image_width
int32
| image_height
int32
| flickr_original_url
string
| flickr_300k_url
string
| answers
sequence
| image_classes
sequence
| set_name
string
|
---|---|---|---|---|---|---|---|---|---|---|---|
"0054c91397f2fe05" | 0 | "what is the brand of phone?" | [
"what",
"is",
"the",
"brand",
"of",
"phone"
] | 1,024 | 730 | "https://farm6.staticflickr.com/2891/9134076951_f65b421097_o.jpg" | "https://c4.staticflickr.com/3/2891/9134076951_9db89d3e0f_z.jpg" | [
"nokia",
"nokia",
"nokia",
"nokia",
"toshiba",
"nokia",
"nokia",
"nokia",
"nokia",
"nokia"
] | [
"Belt",
"Headphones",
"Goggles",
"Scale",
"Bottle opener",
"Mobile phone",
"Mirror",
"Digital clock",
"Television",
"Telephone",
"Tool",
"Wheel",
"Camera",
"Watch",
"Glasses",
"Aircraft"
] | "train" |
|
"005635e119b9f32f" | 1 | "what type of plane is this?" | [
"what",
"type",
"of",
"plane",
"is",
"this"
] | 1,024 | 667 | "https://c8.staticflickr.com/3/2579/5811451782_31f8649055_o.jpg" | "https://c8.staticflickr.com/3/2579/5811451782_f6c4633327_z.jpg" | [
"lape",
"cargo",
"ec-agg",
"lape",
"lape",
"lape",
"lape",
"lape",
"lape",
"airplane"
] | [
"Vehicle",
"Helicopter",
"Airplane",
"Bomb",
"Aircraft"
] | "train" |
|
"005635e119b9f32f" | 2 | "what are the letters on the tail section of the plane?" | [
"what",
"are",
"the",
"letters",
"on",
"the",
"tail",
"section",
"of",
"the",
"plane"
] | 1,024 | 667 | "https://c8.staticflickr.com/3/2579/5811451782_31f8649055_o.jpg" | "https://c8.staticflickr.com/3/2579/5811451782_f6c4633327_z.jpg" | [
"ec agg",
"ec-agg",
"ec",
"ec-agg",
"ec",
"ec",
"ec",
"ec",
"ec goeland",
"ec"
] | [
"Vehicle",
"Helicopter",
"Airplane",
"Bomb",
"Aircraft"
] | "train" |
|
"00685bc495504d61" | 3 | "who is this copyrighted by?" | [
"who",
"is",
"this",
"copyrighted",
"by"
] | 786 | 1,024 | "https://farm2.staticflickr.com/5067/5620759429_4ea686e643_o.jpg" | "https://c5.staticflickr.com/6/5067/5620759429_f43a649fb5_z.jpg" | [
"simon clancy",
"simon ciancy",
"simon clancy",
"simon clancy",
"the brand is bayard",
"simon clancy",
"simon clancy",
"simon clancy",
"simon clancy",
"simon clancy"
] | [
"Vehicle",
"Tower",
"Airplane",
"Aircraft"
] | "train" |
|
"00685bc495504d61" | 4 | "what brand is on the plane?" | [
"what",
"brand",
"is",
"on",
"the",
"plane"
] | 786 | 1,024 | "https://farm2.staticflickr.com/5067/5620759429_4ea686e643_o.jpg" | "https://c5.staticflickr.com/6/5067/5620759429_f43a649fb5_z.jpg" | [
"virgin is the brand on the plane.",
"virgin mobile",
"virgin",
"virgin",
"virgin",
"virgin",
"virgin",
"virgin",
"virgin",
"virgin"
] | [
"Vehicle",
"Tower",
"Airplane",
"Aircraft"
] | "train" |
|
"006d10667d17b924" | 5 | "what year is shown in the photo?" | [
"what",
"year",
"is",
"shown",
"in",
"the",
"photo"
] | 1,024 | 768 | "https://farm5.staticflickr.com/6196/6118278128_09f43f09eb_o.jpg" | "https://c4.staticflickr.com/7/6196/6118278128_0fc8a3e349_z.jpg" | [
"2011",
"2011",
"2011",
"2011",
"2011",
"2011",
"2011",
"2011",
"2011",
"2011"
] | [
"Person",
"Woman",
"Man",
"Tree",
"Clothing",
"Airplane",
"Human face",
"Aircraft"
] | "train" |
|
"006d10667d17b924" | 6 | "what type of meeting is it?" | [
"what",
"type",
"of",
"meeting",
"is",
"it"
] | 1,024 | 768 | "https://farm5.staticflickr.com/6196/6118278128_09f43f09eb_o.jpg" | "https://c4.staticflickr.com/7/6196/6118278128_0fc8a3e349_z.jpg" | [
"rimini",
"royal theater",
"rimini",
"rimini",
"rimini meeting",
"rimini",
"rimini meeting",
"rimini",
"rimini",
"rimini"
] | [
"Person",
"Woman",
"Man",
"Tree",
"Clothing",
"Airplane",
"Human face",
"Aircraft"
] | "train" |
|
"00c359f294f7dcd9" | 7 | "what is the name of this plane?" | [
"what",
"is",
"the",
"name",
"of",
"this",
"plane"
] | 1,024 | 680 | "https://c2.staticflickr.com/9/8103/8562272505_2ce50b5a35_o.jpg" | "https://c1.staticflickr.com/9/8103/8562272505_5b42f9d199_z.jpg" | [
"g-atco",
"g-atco",
"g-atco",
"g-atco",
"g-atco",
"g atco",
"g-atco",
"g-atco",
"g-atco",
"g-atco"
] | [
"Vehicle",
"Helicopter",
"Airplane",
"Aircraft"
] | "train" |
|
"00c359f294f7dcd9" | 8 | "what is the plane's call sign?" | [
"what",
"is",
"the",
"plane",
"'",
"s",
"call",
"sign"
] | 1,024 | 680 | "https://c2.staticflickr.com/9/8103/8562272505_2ce50b5a35_o.jpg" | "https://c1.staticflickr.com/9/8103/8562272505_5b42f9d199_z.jpg" | [
"g-atco",
"g-atco",
"g-atco",
"g-atco",
"g-atco",
"g-atco",
"g-atco",
"g-atco",
"g-atco",
"g-atco"
] | [
"Vehicle",
"Helicopter",
"Airplane",
"Aircraft"
] | "train" |
|
"0122c5279a501df2" | 9 | "what letter is on the plane's tail?" | [
"what",
"letter",
"is",
"on",
"the",
"plane",
"'",
"s",
"tail"
] | 1,024 | 683 | "https://farm3.staticflickr.com/8673/16416250538_594bcb48d2_o.jpg" | "https://c7.staticflickr.com/9/8673/16416250538_9328760686_z.jpg" | [
"f",
"f",
"f",
"f",
"f",
"f",
"f",
"f",
"f",
"f"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0122c5279a501df2" | 10 | "what airline is this?" | [
"what",
"airline",
"is",
"this"
] | 1,024 | 683 | "https://farm3.staticflickr.com/8673/16416250538_594bcb48d2_o.jpg" | "https://c7.staticflickr.com/9/8673/16416250538_9328760686_z.jpg" | [
"finn",
"finn",
"finn air",
"finnair",
"finn",
"finnair",
"finn",
"finnair",
"finnair",
"unanswerable"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"015c537f722d115e" | 11 | "what airline is this plane for?" | [
"what",
"airline",
"is",
"this",
"plane",
"for"
] | 1,024 | 448 | "https://c7.staticflickr.com/3/2682/4481270420_2b2e163583_o.jpg" | "https://c8.staticflickr.com/3/2682/4481270420_728ee098f0_z.jpg" | [
"airfrance",
"air france",
"airfrance",
"air france",
"airfrance",
"airfrance",
"spanish",
"air france",
"air france",
"air france"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0202faf23a9aae11" | 12 | "what does it say on the plane?" | [
"what",
"does",
"it",
"say",
"on",
"the",
"plane"
] | 1,024 | 1,024 | "https://farm2.staticflickr.com/5473/9450199255_515b079639_o.jpg" | "https://c1.staticflickr.com/6/5473/9450199255_43029a3cd1_z.jpg" | [
"croatia",
"croatia",
"roatia",
"roatia",
"croatia",
"croatia",
"croatia",
"croatia",
"roatia",
"roatia "
] | [
"Building",
"Person",
"Vehicle",
"Airplane",
"Human face",
"Aircraft"
] | "train" |
|
"02b2daa20b00f5d3" | 13 | "what website is this jet associated with?" | [
"what",
"website",
"is",
"this",
"jet",
"associated",
"with"
] | 1,024 | 682 | "https://farm4.staticflickr.com/7403/13914609743_2004ef8b74_o.jpg" | "https://c3.staticflickr.com/8/7403/13914609743_23fdb967b3_z.jpg" | [
"jet2.com",
"jet2.com",
"jet2.com",
"jet2.com",
"jet2",
"jet2.com",
"jet2.com",
"jet2.com",
"jet2",
"jet2.com"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"02b2daa20b00f5d3" | 14 | "what does this jet advertise as having?" | [
"what",
"does",
"this",
"jet",
"advertise",
"as",
"having"
] | 1,024 | 682 | "https://farm4.staticflickr.com/7403/13914609743_2004ef8b74_o.jpg" | "https://c3.staticflickr.com/8/7403/13914609743_23fdb967b3_z.jpg" | [
"friendly low fares",
"friendly low fares",
"friendly low fares",
"friendly low fares",
"friendly low fares",
"friendly low fares",
"friendly low fares",
"friendly low fairs",
"friendly low fares",
"friendly low fares"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"02da7a42e971e3cb" | 15 | "what letters are embellished on the parachute?" | [
"what",
"letters",
"are",
"embellished",
"on",
"the",
"parachute"
] | 1,024 | 903 | "https://c4.staticflickr.com/5/4102/4759963602_47a83be793_o.jpg" | "https://c4.staticflickr.com/5/4102/4759963602_5ea2beab2f_z.jpg" | [
"raf",
"raf",
"raf",
"raf",
"r a f",
"raf",
"raf",
"raf",
"raf",
"raf"
] | [
"Vehicle",
"Helicopter",
"Parachute",
"Aircraft"
] | "train" |
|
"02da7a42e971e3cb" | 16 | "what are the letters being displayed?" | [
"what",
"are",
"the",
"letters",
"being",
"displayed"
] | 1,024 | 903 | "https://c4.staticflickr.com/5/4102/4759963602_47a83be793_o.jpg" | "https://c4.staticflickr.com/5/4102/4759963602_5ea2beab2f_z.jpg" | [
"raf",
"raf",
"raf",
"raf",
"raf",
"raf",
"raf",
"raf",
"raf",
"raf"
] | [
"Vehicle",
"Helicopter",
"Parachute",
"Aircraft"
] | "train" |
|
"0300066daef7157a" | 17 | "what is the last number of the plane?" | [
"what",
"is",
"the",
"last",
"number",
"of",
"the",
"plane"
] | 1,024 | 683 | "https://c3.staticflickr.com/6/5102/5650736526_9ec0ea741b_o.jpg" | "https://c2.staticflickr.com/6/5102/5650736526_28c63333c5_z.jpg" | [
"3",
"3",
"3",
"traktor",
"3",
"3",
"3",
"3",
"3",
"3"
] | [
"Vehicle",
"Ambulance",
"Airplane",
"Aircraft"
] | "train" |
|
"0320299003ff65fb" | 18 | "what letters are visible on the tail of the helicopter?" | [
"what",
"letters",
"are",
"visible",
"on",
"the",
"tail",
"of",
"the",
"helicopter"
] | 1,024 | 584 | "https://c4.staticflickr.com/9/8003/7476657322_eb2e6cf61c_o.jpg" | "https://c7.staticflickr.com/9/8003/7476657322_225c533aae_z.jpg" | [
"adx",
"danger",
"ad",
"adx",
"adx",
"adx",
"adx",
"adx",
"adx",
"adx"
] | [
"Person",
"Vehicle",
"Helicopter",
"Footwear",
"Airplane",
"Aircraft"
] | "train" |
|
"0393c9d77b8215a3" | 19 | "what is the plane name?" | [
"what",
"is",
"the",
"plane",
"name"
] | 1,024 | 532 | "https://farm4.staticflickr.com/7317/8720879958_791373f9ce_o.jpg" | "https://c1.staticflickr.com/8/7317/8720879958_85e108e726_z.jpg" | [
"728tfw",
"f-16",
"wp",
"wp",
"wp",
"wp",
"wp",
"wp",
"wp",
"unanswerable"
] | [
"Vehicle",
"Boat",
"Clock",
"Airplane",
"Aircraft"
] | "train" |
|
"0393c9d77b8215a3" | 20 | "what is the plane number?" | [
"what",
"is",
"the",
"plane",
"number"
] | 1,024 | 532 | "https://farm4.staticflickr.com/7317/8720879958_791373f9ce_o.jpg" | "https://c1.staticflickr.com/8/7317/8720879958_85e108e726_z.jpg" | [
"728",
"728tfw",
"81",
"81728",
"728tfw",
"unanswerable",
"af91728tfw",
"728tfw",
"8",
"wp"
] | [
"Vehicle",
"Boat",
"Clock",
"Airplane",
"Aircraft"
] | "train" |
|
"0394933dcd965048" | 21 | "what does the train say?" | [
"what",
"does",
"the",
"train",
"say"
] | 1,024 | 769 | "https://c1.staticflickr.com/8/7271/7013089685_34455c39b6_o.jpg" | "https://c8.staticflickr.com/8/7271/7013089685_e28a82e463_z.jpg" | [
"ricklinghausen",
"rocklinghausen ",
"recklinghausen",
"rockinghausen",
"rocklinghausen",
"recklinghausen ",
"rocklinghausen",
"rocklinghausen",
"recklkinghausen",
"recklinghausen "
] | [
"Train",
"Boy",
"Person",
"Woman",
"Man",
"Vehicle",
"Clothing",
"Footwear",
"Airplane",
"Aircraft"
] | "train" |
|
"03e38d16213c4067" | 22 | "what organization do these men work for?" | [
"what",
"organization",
"do",
"these",
"men",
"work",
"for"
] | 1,024 | 682 | "https://farm8.staticflickr.com/3383/3184901622_cf5f09010a_o.jpg" | "https://c3.staticflickr.com/4/3383/3184901622_ff62bb323c_z.jpg" | [
"politi",
"politi",
"politi",
"politi",
"politi",
"politi",
"politi",
"politi",
"politi",
"police"
] | [
"Bus",
"Ambulance",
"Person",
"Land vehicle",
"Stretcher",
"Vehicle",
"Auto part",
"Van",
"Tire",
"Car",
"Aircraft"
] | "train" |
|
"03e38d16213c4067" | 23 | "what two numbers can be seen on the white post on the right?" | [
"what",
"two",
"numbers",
"can",
"be",
"seen",
"on",
"the",
"white",
"post",
"on",
"the",
"right"
] | 1,024 | 682 | "https://farm8.staticflickr.com/3383/3184901622_cf5f09010a_o.jpg" | "https://c3.staticflickr.com/4/3383/3184901622_ff62bb323c_z.jpg" | [
"23",
"25",
"25 50",
"unanswerable",
"25, 50 ",
"25 and 50",
"25, 50",
"25, 50",
"23 50",
"25-50"
] | [
"Bus",
"Ambulance",
"Person",
"Land vehicle",
"Stretcher",
"Vehicle",
"Auto part",
"Van",
"Tire",
"Car",
"Aircraft"
] | "train" |
|
"0492ee5ac9a69515" | 24 | "what letters are on the craft?" | [
"what",
"letters",
"are",
"on",
"the",
"craft"
] | 1,024 | 683 | "https://farm6.staticflickr.com/2909/14180467188_8880ca1448_o.jpg" | "https://c6.staticflickr.com/3/2909/14180467188_5d7ab99262_z.jpg" | [
"f-pdhv",
"f-pdhv",
"f-pdhv",
"f-pdhv",
"f-pdhv",
"fpdhv",
"f-pdhv",
"f-pdhv",
"f-pdhv",
"f-pdhv"
] | [
"Person",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0492ee5ac9a69515" | 25 | "what website is labeled on the plane?" | [
"what",
"website",
"is",
"labeled",
"on",
"the",
"plane"
] | 1,024 | 683 | "https://farm6.staticflickr.com/2909/14180467188_8880ca1448_o.jpg" | "https://c6.staticflickr.com/3/2909/14180467188_5d7ab99262_z.jpg" | [
"fpdhv",
"www.verheesengineering.com",
"unanswerable",
"verhesengineering.com",
"unanswerable",
"verheesenginering.com",
"www.verheesengineering.com",
"www.verhwwsengineering.com",
"2",
"www.verheesengineering.com"
] | [
"Person",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"04dfad38d434409c" | 26 | "what happens if you pull?" | [
"what",
"happens",
"if",
"you",
"pull"
] | 973 | 1,024 | "https://c5.staticflickr.com/4/3315/3411908910_a8ecbfcaeb_o.jpg" | "https://c8.staticflickr.com/4/3315/3411908910_4c2f8e5fbc_z.jpg?zz=1" | [
"stop",
"emergency stop",
"emergency stop",
"stop",
"stop",
"emergency stop",
"emergency stop",
"it is an emergency stop",
"emergency stop",
"unanswerable"
] | [
"Land vehicle",
"Vehicle",
"Airplane",
"Car",
"Aircraft"
] | "train" |
|
"04dfad38d434409c" | 27 | "what does the sticker on the window say?" | [
"what",
"does",
"the",
"sticker",
"on",
"the",
"window",
"say"
] | 973 | 1,024 | "https://c5.staticflickr.com/4/3315/3411908910_a8ecbfcaeb_o.jpg" | "https://c8.staticflickr.com/4/3315/3411908910_4c2f8e5fbc_z.jpg?zz=1" | [
"aa",
"aa",
"aa",
"aa",
"aa",
"the sticker says emergency stop pull.",
"aa",
"emergency stop pull",
"aa",
"aa"
] | [
"Land vehicle",
"Vehicle",
"Airplane",
"Car",
"Aircraft"
] | "train" |
|
"05a4fa87bfdaa243" | 28 | "what made this airplane?" | [
"what",
"made",
"this",
"airplane"
] | 1,024 | 682 | "https://farm4.staticflickr.com/59/200317024_00b0454199_o.jpg" | "https://c1.staticflickr.com/1/59/200317024_00b0454199_z.jpg?zz=1" | [
"biman",
"biman",
"biman",
"biman",
"biman",
"biman",
"biman",
"biman",
"biman",
"biman"
] | [
"Vehicle",
"Rocket",
"Airplane",
"Aircraft"
] | "train" |
|
"05a4fa87bfdaa243" | 29 | "what kind of plane is this?" | [
"what",
"kind",
"of",
"plane",
"is",
"this"
] | 1,024 | 682 | "https://farm4.staticflickr.com/59/200317024_00b0454199_o.jpg" | "https://c1.staticflickr.com/1/59/200317024_00b0454199_z.jpg?zz=1" | [
"biman",
"biman",
"bitman",
"biman",
"biman",
"dc 10-30",
"biman",
"biman",
"biman",
"biman"
] | [
"Vehicle",
"Rocket",
"Airplane",
"Aircraft"
] | "train" |
|
"05e8721b42640337" | 30 | "what are the numbers on the plane?" | [
"what",
"are",
"the",
"numbers",
"on",
"the",
"plane"
] | 1,024 | 681 | "https://farm8.staticflickr.com/5473/11998619305_9887d267f3_o.jpg" | "https://c6.staticflickr.com/6/5473/11998619305_f909aa2fc2_z.jpg" | [
"n337am",
"n337am",
"n337am",
"n337am",
"337",
"n337am",
"337",
"337",
"n337am",
"337"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"06628eaed257fa24" | 31 | "what word does one of the hot balloons feature?" | [
"what",
"word",
"does",
"one",
"of",
"the",
"hot",
"balloons",
"feature"
] | 1,024 | 683 | "https://farm5.staticflickr.com/5546/10262386264_6915b9587b_o.jpg" | "https://c3.staticflickr.com/6/5546/10262386264_f40d3fc168_z.jpg" | [
"when",
"when",
"yamaha",
"when",
"when",
"when pigs fly",
"bobby j's",
"when",
"when",
"when apes fly"
] | [
"Vehicle",
"Balloon",
"Parachute",
"Aircraft"
] | "train" |
|
"073f668cdc671c37" | 32 | "where is the plane going?" | [
"where",
"is",
"the",
"plane",
"going"
] | 1,024 | 683 | "https://farm8.staticflickr.com/8292/7496758474_eea4bc6745_o.jpg" | "https://c3.staticflickr.com/9/8292/7496758474_ef1827aaff_z.jpg" | [
"south african",
"worth",
"unanswerable",
"unanswerable",
"unanswerable",
"unanswerable",
"south africa",
"south africa",
"south africa",
"answering does not require reading text in the image"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"073f668cdc671c37" | 33 | "what type of plane is this?" | [
"what",
"type",
"of",
"plane",
"is",
"this"
] | 1,024 | 683 | "https://farm8.staticflickr.com/8292/7496758474_eea4bc6745_o.jpg" | "https://c3.staticflickr.com/9/8292/7496758474_ef1827aaff_z.jpg" | [
"south african",
"hidehi matsui",
"south african",
"south african",
"south african",
"south african",
"south african ",
"south africa",
"south african",
"707"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0782a21a2538f868" | 34 | "what does the white sign say?" | [
"what",
"does",
"the",
"white",
"sign",
"say"
] | 1,024 | 408 | "https://c1.staticflickr.com/3/2645/3891762410_be9ed7d0c3_o.jpg" | "https://c7.staticflickr.com/3/2645/3891762410_3fe4d1e701_z.jpg" | [
"a380",
"airbus a380",
"unanswerable",
"unanswerable",
"unanswerable",
"airbus a380",
"airbus",
"airbus",
"unanswerable",
"airbus"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0782a21a2538f868" | 35 | "what model of plane is this?" | [
"what",
"model",
"of",
"plane",
"is",
"this"
] | 1,024 | 408 | "https://c1.staticflickr.com/3/2645/3891762410_be9ed7d0c3_o.jpg" | "https://c7.staticflickr.com/3/2645/3891762410_3fe4d1e701_z.jpg" | [
"airbus",
"a380",
"airbus a380",
"airbus a380",
"airbus a380",
"macbook air ",
"airbus",
"airbus a380",
"airbus a380",
"airbus"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"080858916c6e0c6c" | 36 | "what number is on the plane?" | [
"what",
"number",
"is",
"on",
"the",
"plane"
] | 1,024 | 683 | "https://c5.staticflickr.com/7/6150/5937848980_881ac05f0c_o.jpg" | "https://c8.staticflickr.com/7/6150/5937848980_4ae6e52643_z.jpg" | [
"41",
"41",
"41",
"41",
"41",
"41",
"41",
"41",
"41",
"41"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"080858916c6e0c6c" | 37 | "is there any other text on the plane?" | [
"is",
"there",
"any",
"other",
"text",
"on",
"the",
"plane"
] | 1,024 | 683 | "https://c5.staticflickr.com/7/6150/5937848980_881ac05f0c_o.jpg" | "https://c8.staticflickr.com/7/6150/5937848980_4ae6e52643_z.jpg" | [
"yes",
"yes",
"41",
"yes, luk",
"41",
"yes",
"yes",
"yes",
"41",
"yes"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"08d8e9951c1ea69e" | 38 | "what is the number and letter of the plane?" | [
"what",
"is",
"the",
"number",
"and",
"letter",
"of",
"the",
"plane"
] | 1,024 | 683 | "https://c7.staticflickr.com/6/5238/5878712742_ef6f46119e_o.jpg" | "https://c1.staticflickr.com/6/5238/5878712742_c5ce03dd9d_z.jpg" | [
"n328kf",
"n328kf",
"n328kf",
"n328kf",
"n328kf",
"n328kf",
"n328kf",
"n328kf",
"n328kf",
"n328kff"
] | [
"Table",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"08d8e9951c1ea69e" | 39 | "which company owns that aircraft?" | [
"which",
"company",
"owns",
"that",
"aircraft"
] | 1,024 | 683 | "https://c7.staticflickr.com/6/5238/5878712742_ef6f46119e_o.jpg" | "https://c1.staticflickr.com/6/5238/5878712742_c5ce03dd9d_z.jpg" | [
"scaled composites",
"spaceshipone",
"unanswerable",
"scaled composites",
"spaceshipone",
"spaceshipone",
"scaled composites ",
"scaled",
"space ship one",
"scaled"
] | [
"Table",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"096bce1a91c1624c" | 40 | "to which organization does this helicopter belong?" | [
"to",
"which",
"organization",
"does",
"this",
"helicopter",
"belong"
] | 1,024 | 683 | "https://c5.staticflickr.com/5/4101/4912152432_95b0f3f317_o.jpg" | "https://c6.staticflickr.com/5/4101/4912152432_a134f2da27_z.jpg" | [
"department of public safety",
"public safety",
"department of public safety ",
"public safety",
"public safety",
"public safety",
"department of public safety",
"department of public safety",
"utah department of public safety",
"department of public safety"
] | [
"Vehicle",
"Helicopter",
"Aircraft"
] | "train" |
|
"09d87dca721d23be" | 41 | "what kind of airline is this?" | [
"what",
"kind",
"of",
"airline",
"is",
"this"
] | 1,024 | 683 | "https://c7.staticflickr.com/9/8065/8253258734_c09a5fba2f_o.jpg" | "https://c2.staticflickr.com/9/8065/8253258734_f5c7d25b95_z.jpg" | [
"lufthansa ",
"lufthansa cargo",
"lufthansa cargo",
"lufthansa cargo",
"luthansa cargo",
"lufthansa cargo",
"lufthansa",
"lufthansa cargo",
"lufthansa cargo",
"luthsana cargo"
] | [
"Doll",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0b187390a96228ff" | 42 | "what brand is this phone?" | [
"what",
"brand",
"is",
"this",
"phone"
] | 1,024 | 768 | "https://c8.staticflickr.com/4/3267/2841854688_5b102a5f47_o.jpg" | "https://c8.staticflickr.com/4/3267/2841854688_174d48f551_z.jpg?zz=1" | [
"fedex",
"unanswerable",
"fedex",
"unanswerable",
"unanswerable",
"fedex",
"unanswerable",
"unanswerable",
"fedex",
"unanswerable"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0b187390a96228ff" | 43 | "is the word express under the word fedex at the front of the plane?" | [
"is",
"the",
"word",
"express",
"under",
"the",
"word",
"fedex",
"at",
"the",
"front",
"of",
"the",
"plane"
] | 1,024 | 768 | "https://c8.staticflickr.com/4/3267/2841854688_5b102a5f47_o.jpg" | "https://c8.staticflickr.com/4/3267/2841854688_174d48f551_z.jpg?zz=1" | [
"yes",
"express",
"yes",
"yes",
"yes",
"express",
"yes",
"yes",
"yes",
"yes"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0b2f523a4e734bec" | 44 | "what is the plane number?" | [
"what",
"is",
"the",
"plane",
"number"
] | 1,024 | 683 | "https://c5.staticflickr.com/8/7118/7768063976_fdc66843c6_o.jpg" | "https://c1.staticflickr.com/8/7118/7768063976_ea9351e86f_z.jpg" | [
"ec-634",
"ec-634",
"ec-634",
"ec-634",
"ec-354",
"ec-634",
"ec-634",
"ec-634",
"the taste of freedom",
"634"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0b2f523a4e734bec" | 45 | "what is the name of the plane?" | [
"what",
"is",
"the",
"name",
"of",
"the",
"plane"
] | 1,024 | 683 | "https://c5.staticflickr.com/8/7118/7768063976_fdc66843c6_o.jpg" | "https://c1.staticflickr.com/8/7118/7768063976_ea9351e86f_z.jpg" | [
"inta",
"ec-634",
"inta",
"not all those who wander are lost",
"inta",
"inta",
"ec-634",
"inta",
"inta",
"inta ec-634"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0e40cb14d4c4bbbc" | 46 | "what is the id number written on the rear of the plane?" | [
"what",
"is",
"the",
"id",
"number",
"written",
"on",
"the",
"rear",
"of",
"the",
"plane"
] | 1,024 | 683 | "https://c5.staticflickr.com/6/5448/9320917774_4f524cc61e_o.jpg" | "https://c5.staticflickr.com/6/5448/9320917774_952331468c_z.jpg" | [
"d-chio",
"d-chi0",
"0",
"0chio",
"d-chio",
"d-chio",
"d-chio",
"d-chio",
"d-chio",
"d-chio"
] | [
"Tree",
"Vehicle",
"Clothing",
"Footwear",
"Airplane",
"Aircraft"
] | "train" |
|
"0e45202f3462f604" | 47 | "what company is this for?" | [
"what",
"company",
"is",
"this",
"for"
] | 1,024 | 768 | "https://farm5.staticflickr.com/2081/2478405105_c9ea1fa2f9_o.jpg" | "https://c6.staticflickr.com/3/2081/2478405105_37f0295257_z.jpg" | [
"java",
"java",
"java",
"java",
"java",
"java",
"java",
"java",
"jave",
"java"
] | [
"Vehicle",
"Human eye",
"Person",
"Human mouth",
"Human ear",
"Human hair",
"Human head",
"Man",
"Airplane",
"Human face",
"Human nose",
"Aircraft"
] | "train" |
|
"0e45202f3462f604" | 48 | "what does the sign want to add to java?" | [
"what",
"does",
"the",
"sign",
"want",
"to",
"add",
"to",
"java"
] | 1,024 | 768 | "https://farm5.staticflickr.com/2081/2478405105_c9ea1fa2f9_o.jpg" | "https://c6.staticflickr.com/3/2081/2478405105_37f0295257_z.jpg" | [
"you",
"you",
"you",
"you",
"you",
"you",
"you",
"you",
"you",
"you"
] | [
"Vehicle",
"Human eye",
"Person",
"Human mouth",
"Human ear",
"Human hair",
"Human head",
"Man",
"Airplane",
"Human face",
"Human nose",
"Aircraft"
] | "train" |
|
"0fd38fd0f9bf1bee" | 49 | "the train number is?" | [
"the",
"train",
"number",
"is"
] | 1,024 | 764 | "https://c7.staticflickr.com/4/3727/19693966404_b5cacb3d24_o.jpg" | "https://c6.staticflickr.com/4/3727/19693966404_90016a2fce_z.jpg" | [
"1450",
"1450",
"may 10th 2009",
"1450",
"1450",
"1450",
"450",
"1450",
"1450",
"1450"
] | [
"Vehicle",
"Land vehicle",
"Plant",
"Train",
"Wheel",
"Auto part",
"Aircraft"
] | "train" |
|
"0fef083ac0b5dff6" | 50 | "what letter is written on the side of the red propeller plane?" | [
"what",
"letter",
"is",
"written",
"on",
"the",
"side",
"of",
"the",
"red",
"propeller",
"plane"
] | 1,024 | 768 | "https://farm7.staticflickr.com/3062/2888363565_8f3e5ef325_o.jpg" | "https://c5.staticflickr.com/4/3062/2888363565_1ae375f3ac_z.jpg?zz=1" | [
"a",
"unanswerable",
"unanswerable",
"618",
"no text in image",
"a",
"a",
"unanswerable",
"unanswerable",
"unanswerable"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"0fef083ac0b5dff6" | 51 | "what 3 digit number is on the plane in the back?" | [
"what",
"3",
"digit",
"number",
"is",
"on",
"the",
"plane",
"in",
"the",
"back"
] | 1,024 | 768 | "https://farm7.staticflickr.com/3062/2888363565_8f3e5ef325_o.jpg" | "https://c5.staticflickr.com/4/3062/2888363565_1ae375f3ac_z.jpg?zz=1" | [
"987",
"987",
"pepsi",
"987",
"987",
"612",
"987",
"987",
"987",
"997"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"10513fda50da5e6d" | 52 | "what is the id number on the tail of the plane?" | [
"what",
"is",
"the",
"id",
"number",
"on",
"the",
"tail",
"of",
"the",
"plane"
] | 1,024 | 768 | "https://c6.staticflickr.com/4/3236/3113140447_9a14368487_o.jpg" | "https://c1.staticflickr.com/4/3236/3113140447_7a98021e3c_z.jpg" | [
"513",
"513",
"513",
"513",
"xt5v3",
"513",
"513",
"513",
"513",
"513"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1086a31c9367a124" | 53 | "what is the word on the side of this plane?" | [
"what",
"is",
"the",
"word",
"on",
"the",
"side",
"of",
"this",
"plane"
] | 1,024 | 682 | "https://farm5.staticflickr.com/1008/1313880307_2c59c70ffa_o.jpg" | "https://c3.staticflickr.com/2/1008/1313880307_0734ab83fb_z.jpg" | [
"team ameristar.com",
"teamaerostar.com",
"teamaerostar.com",
"teamaerostar.com",
"teamaerostar.com",
"34",
"teamaerostar",
"teamaerostar.com",
"teamaerostar",
"teamaerostar.com"
] | [
"Person",
"Land vehicle",
"Vehicle",
"Airplane",
"Car",
"Aircraft"
] | "train" |
|
"1086a31c9367a124" | 54 | "what number is this plane?" | [
"what",
"number",
"is",
"this",
"plane"
] | 1,024 | 682 | "https://farm5.staticflickr.com/1008/1313880307_2c59c70ffa_o.jpg" | "https://c3.staticflickr.com/2/1008/1313880307_0734ab83fb_z.jpg" | [
"34",
"34",
"34",
"34",
"34",
"34",
"34",
"34",
"34",
"34"
] | [
"Person",
"Land vehicle",
"Vehicle",
"Airplane",
"Car",
"Aircraft"
] | "train" |
|
"111d7be56517ed46" | 55 | "what is written on the side of this airplane?" | [
"what",
"is",
"written",
"on",
"the",
"side",
"of",
"this",
"airplane"
] | 1,024 | 683 | "https://farm7.staticflickr.com/7420/9489993636_a69b8334dc_o.jpg" | "https://c4.staticflickr.com/8/7420/9489993636_0d401d5a13_z.jpg" | [
"cs-dkd",
"cs-dkd",
"cs-dkd",
"cs-dkd",
"cs-dkd",
"c9-dkd",
"cs-dkd",
"cs-oko",
"cs-dkd",
"cs-dkd"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"111d7be56517ed46" | 56 | "what number is on the plane?" | [
"what",
"number",
"is",
"on",
"the",
"plane"
] | 1,024 | 683 | "https://farm7.staticflickr.com/7420/9489993636_a69b8334dc_o.jpg" | "https://c4.staticflickr.com/8/7420/9489993636_0d401d5a13_z.jpg" | [
"csdkd",
"cs-dk0",
"8",
"cs-0k0",
"no number",
"cs-dkd",
"unanswerable",
"cs-dkd",
"cs-dkd",
"xs-dkd"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1139456aa3f70a34" | 57 | "what number is on the silver plane?" | [
"what",
"number",
"is",
"on",
"the",
"silver",
"plane"
] | 1,024 | 680 | "https://farm5.staticflickr.com/5282/5304752422_73f0c89306_o.jpg" | "https://c6.staticflickr.com/6/5282/5304752422_796df24693_z.jpg" | [
"63",
"63",
"63",
"63",
"63",
"63",
"63",
"63",
"63",
"63"
] | [
"Vehicle",
"Clothing",
"Airplane",
"Aircraft"
] | "train" |
|
"1252873867aa8a86" | 58 | "what identification number belongs to the big bomber plane in the center?" | [
"what",
"identification",
"number",
"belongs",
"to",
"the",
"big",
"bomber",
"plane",
"in",
"the",
"center"
] | 1,024 | 681 | "https://farm7.staticflickr.com/4099/4879614318_cce9817c94_o.jpg" | "https://c7.staticflickr.com/5/4099/4879614318_6c19f98122_z.jpg" | [
"82",
"82",
"82",
"82",
"82",
"82",
"82",
"82",
"82",
"82"
] | [
"Vehicle",
"Wheel",
"Airplane",
"Aircraft"
] | "train" |
|
"1252873867aa8a86" | 59 | "what letter is in the black circle?" | [
"what",
"letter",
"is",
"in",
"the",
"black",
"circle"
] | 1,024 | 681 | "https://farm7.staticflickr.com/4099/4879614318_cce9817c94_o.jpg" | "https://c7.staticflickr.com/5/4099/4879614318_6c19f98122_z.jpg" | [
"r",
"r",
"r",
"r",
"r",
"r",
"r",
"r",
"r",
"r"
] | [
"Vehicle",
"Wheel",
"Airplane",
"Aircraft"
] | "train" |
|
"12e4f6ee148a112e" | 60 | "what airline is this plane for?" | [
"what",
"airline",
"is",
"this",
"plane",
"for"
] | 1,024 | 683 | "https://c7.staticflickr.com/9/8514/8570025824_708ec5043f_o.jpg" | "https://c6.staticflickr.com/9/8514/8570025824_1108160a25_z.jpg" | [
"onurair",
"onurair",
"onurair",
"onurair",
"onurair",
"onurair",
"nurair",
"onurair",
"onurair",
"onurair"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"12e4f6ee148a112e" | 61 | "what is the name of the airline?" | [
"what",
"is",
"the",
"name",
"of",
"the",
"airline"
] | 1,024 | 683 | "https://c7.staticflickr.com/9/8514/8570025824_708ec5043f_o.jpg" | "https://c6.staticflickr.com/9/8514/8570025824_1108160a25_z.jpg" | [
"onurair",
"onurair",
"onurair",
"onurair",
"onurair",
"onurair",
"onurair",
"onurair",
"onurair",
"onurair"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1334b6b8fcfd8afb" | 62 | "what number is on the grey and yellow plane?" | [
"what",
"number",
"is",
"on",
"the",
"grey",
"and",
"yellow",
"plane"
] | 1,024 | 737 | "https://c4.staticflickr.com/4/3213/5783769965_095c1521cc_o.jpg" | "https://c1.staticflickr.com/4/3213/5783769965_c47052b1b9_z.jpg" | [
"481",
"481",
"48i",
"481",
"481",
"481",
"481",
"481",
"481",
"481"
] | [
"Person",
"Vehicle",
"Building",
"Airplane",
"Aircraft"
] | "train" |
|
"13dec395531b458f" | 63 | "two letters on the tail?" | [
"two",
"letters",
"on",
"the",
"tail"
] | 1,024 | 683 | "https://c8.staticflickr.com/4/3423/3254044550_01df334a26_o.jpg" | "https://c1.staticflickr.com/4/3423/3254044550_9937e4cbff_z.jpg" | [
"ah",
"aj",
"ah",
"ah",
"ah",
"ah",
"ah",
"ah",
"ah",
"ah"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"13dec395531b458f" | 64 | "what is the number found on the head of the plane?" | [
"what",
"is",
"the",
"number",
"found",
"on",
"the",
"head",
"of",
"the",
"plane"
] | 1,024 | 683 | "https://c8.staticflickr.com/4/3423/3254044550_01df334a26_o.jpg" | "https://c1.staticflickr.com/4/3423/3254044550_9937e4cbff_z.jpg" | [
"106",
"106",
"106",
"106",
"106",
"106",
"106",
"106",
"106",
"106"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"142076607dbdfa59" | 65 | "is that plane part of the star alliance?" | [
"is",
"that",
"plane",
"part",
"of",
"the",
"star",
"alliance"
] | 1,024 | 683 | "https://c2.staticflickr.com/6/5618/20923572869_97e8af2aff_o.jpg" | "https://c5.staticflickr.com/6/5618/20923572869_957a38f2d4_z.jpg" | [
"yes",
"yes",
"star alliance",
"yes",
"yes",
"yes",
"sunkist",
"yes",
"yes",
"stop"
] | [
"Building",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"14fa38ae8fea318d" | 66 | "who owns the blimp?" | [
"who",
"owns",
"the",
"blimp"
] | 1,024 | 681 | "https://c6.staticflickr.com/8/7298/13898426779_e9bda6bb6f_o.jpg" | "https://c2.staticflickr.com/8/7298/13898426779_dcce47bbd3_z.jpg" | [
"u.s. navy",
"u.s. navy",
"u.s. navy",
"us navy",
"u.s. navy",
"u.s. navy",
"u.s. navy",
"us navy",
"us navy",
"us navy"
] | [
"Vehicle",
"Aircraft"
] | "train" |
|
"1582c8538686d50d" | 67 | "what number is on the plane?" | [
"what",
"number",
"is",
"on",
"the",
"plane"
] | 1,024 | 683 | "https://c6.staticflickr.com/8/7270/7733381602_7db0f9bc57_o.jpg" | "https://c5.staticflickr.com/8/7270/7733381602_60faa5782c_z.jpg" | [
"155",
"155",
"155",
"155",
"155",
"55",
"155",
"155",
"155",
"155"
] | [
"Person",
"Vehicle",
"Clothing",
"Footwear",
"Airplane",
"Aircraft"
] | "train" |
|
"15a2517d9e904afa" | 68 | "what airline does this plane belong to?" | [
"what",
"airline",
"does",
"this",
"plane",
"belong",
"to"
] | 1,024 | 683 | "https://c3.staticflickr.com/4/3904/14622637909_597d459db0_o.jpg" | "https://c5.staticflickr.com/4/3904/14622637909_f25e608e95_z.jpg" | [
"iberia",
"iberia",
"iberia",
"iberia",
"iberia",
"iberia",
"iberia",
"iberia",
"iberia ",
"iberia"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"15a48b2dd565998a" | 69 | "what is this?" | [
"what",
"is",
"this"
] | 1,024 | 683 | "https://c3.staticflickr.com/5/4115/4825846889_fda143463b_o.jpg" | "https://c2.staticflickr.com/5/4115/4825846889_35ab6efbe0_z.jpg" | [
"an a380 airbus",
"airbus",
"answering does not require reading text in the image",
"airbus",
"airbus a380",
"airbus",
"airplane",
"airplane",
"answering does not require reading text in the image",
"answering does not require reading text in the image"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"15a48b2dd565998a" | 70 | "what is the identification number?" | [
"what",
"is",
"the",
"identification",
"number"
] | 1,024 | 683 | "https://c3.staticflickr.com/5/4115/4825846889_fda143463b_o.jpg" | "https://c2.staticflickr.com/5/4115/4825846889_35ab6efbe0_z.jpg" | [
"a380",
"a380",
"a380",
"a380",
"a380",
"a380",
"a380",
"a380",
"a380",
"a380"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"15da5c794efe07b6" | 71 | "what is this planes website?" | [
"what",
"is",
"this",
"planes",
"website"
] | 1,024 | 582 | "https://farm7.staticflickr.com/5594/14944706458_b4993e9894_o.jpg" | "https://c8.staticflickr.com/6/5594/14944706458_99886c5f94_z.jpg" | [
"monarch.co.uk",
"monarch.co.uk",
"monarch.co.uk",
"monarch.co.uk",
"monarch.co.uk",
"monarch.co.uk",
"monarch.co.uk",
"monarch.co.uk",
"monach.co.uk",
"monarch.co.uk"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"15da5c794efe07b6" | 72 | "what letter is the symbol on the tale of the plane?" | [
"what",
"letter",
"is",
"the",
"symbol",
"on",
"the",
"tale",
"of",
"the",
"plane"
] | 1,024 | 582 | "https://farm7.staticflickr.com/5594/14944706458_b4993e9894_o.jpg" | "https://c8.staticflickr.com/6/5594/14944706458_99886c5f94_z.jpg" | [
"m",
"m",
"m",
"m",
"m",
"m",
"m",
"m",
"m",
"m"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"163c6f54edee23ae" | 73 | "what does it say on the jet?" | [
"what",
"does",
"it",
"say",
"on",
"the",
"jet"
] | 1,024 | 680 | "https://c7.staticflickr.com/8/7393/8727264764_9888660299_o.jpg" | "https://c3.staticflickr.com/8/7393/8727264764_04b6359b61_z.jpg" | [
"u.s. airforce",
"us air force",
"u.s. air force",
"us air force",
"u.s. air force",
"us air force",
"u.s. air force",
"us air force",
"us air force",
"us air force"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"163c6f54edee23ae" | 74 | "what branch of the military is the plane from?" | [
"what",
"branch",
"of",
"the",
"military",
"is",
"the",
"plane",
"from"
] | 1,024 | 680 | "https://c7.staticflickr.com/8/7393/8727264764_9888660299_o.jpg" | "https://c3.staticflickr.com/8/7393/8727264764_04b6359b61_z.jpg" | [
"us air force",
"u.s. air force",
"air force",
"air force",
"air force",
"us air force ",
"u.s. air force",
"us air force",
"air force",
"u.s. airforce"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1665ff941d73a03f" | 75 | "what is the plane number?" | [
"what",
"is",
"the",
"plane",
"number"
] | 1,024 | 768 | "https://c8.staticflickr.com/1/60/156227243_a0b122c650_o.jpg" | "https://c4.staticflickr.com/1/60/156227243_a0b122c650_z.jpg?zz=1" | [
"105",
"105",
"105",
"105",
"105",
"105",
"105",
"105",
"105",
"105"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1665ff941d73a03f" | 76 | "what branch of the military is on the plane?" | [
"what",
"branch",
"of",
"the",
"military",
"is",
"on",
"the",
"plane"
] | 1,024 | 768 | "https://c8.staticflickr.com/1/60/156227243_a0b122c650_o.jpg" | "https://c4.staticflickr.com/1/60/156227243_a0b122c650_z.jpg?zz=1" | [
"navy",
"navy",
"navy",
"105",
"navy",
"navy",
"navy",
"eugenie & napoleon iii",
"navy",
"navy"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"171d86fc6d86d5f0" | 77 | "what is the tail number of the jet?" | [
"what",
"is",
"the",
"tail",
"number",
"of",
"the",
"jet"
] | 1,024 | 627 | "https://farm1.staticflickr.com/3901/18259467674_a4149a43ae_o.jpg" | "https://c5.staticflickr.com/4/3901/18259467674_fb79d7480d_z.jpg" | [
"13-143",
"13-143",
"13-143",
"13-143",
"13-143",
"13-143",
"13-143",
"13 143",
"13-143",
"13-143"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"188d67c561540d6b" | 78 | "what is the name of the airline?" | [
"what",
"is",
"the",
"name",
"of",
"the",
"airline"
] | 1,024 | 683 | "https://c4.staticflickr.com/3/2910/14622747350_f8c31fbba0_o.jpg" | "https://c2.staticflickr.com/3/2910/14622747350_fe432e6388_z.jpg" | [
"air canada",
"air canada",
"air canada",
"air canada",
"air canada",
"air canada",
"air canada",
"air canada",
"air canada",
"air canada"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1a192511800af150" | 79 | "what number is on the back of this plane?" | [
"what",
"number",
"is",
"on",
"the",
"back",
"of",
"this",
"plane"
] | 1,024 | 393 | "https://farm1.staticflickr.com/7576/15026526724_3bca926dca_o.jpg" | "https://c7.staticflickr.com/8/7576/15026526724_ef993b26c0_z.jpg" | [
"3",
"46100",
"3",
"3",
"3",
"3",
"3",
"3",
"46100",
"46100"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1a2e9a1c8d9432b6" | 80 | "what is the name on the plane?" | [
"what",
"is",
"the",
"name",
"on",
"the",
"plane"
] | 1,024 | 529 | "https://farm7.staticflickr.com/11/16042048_4b71d57a58_o.jpg" | "https://farm7.staticflickr.com/11/16042048_4b71d57a58_o.jpg" | [
"iberia ",
"iberia",
"iberia",
"iberia",
"iberia",
"iberia",
"iberia",
"iberia",
"iberia",
"iberia"
] | [
"Land vehicle",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1a2e9a1c8d9432b6" | 81 | "what is the model number of the plane printed on the rear?" | [
"what",
"is",
"the",
"model",
"number",
"of",
"the",
"plane",
"printed",
"on",
"the",
"rear"
] | 1,024 | 529 | "https://farm7.staticflickr.com/11/16042048_4b71d57a58_o.jpg" | "https://farm7.staticflickr.com/11/16042048_4b71d57a58_o.jpg" | [
"sign of his misery",
"ec-hul",
"ec-hul",
"ec hul",
"ec-hul",
"echul",
"ec-hul",
"hul",
"hul",
"ec-hul"
] | [
"Land vehicle",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1b1c4a2fa6175cf0" | 82 | "what brand is the white balloon?" | [
"what",
"brand",
"is",
"the",
"white",
"balloon"
] | 1,024 | 683 | "https://farm1.staticflickr.com/3783/10262236824_d713bbede4_o.jpg" | "https://c3.staticflickr.com/4/3783/10262236824_02a1022976_z.jpg" | [
"la ,esa",
"lamesra",
"lamesa",
"lamesa",
"lamesa",
"la mesa",
"lamesarv",
"lamesa",
"lamesa",
"lamesah"
] | [
"Toy",
"Balloon",
"Vehicle",
"Aircraft"
] | "train" |
|
"1c28f57e08fc7858" | 83 | "does this plane state that it's easy?" | [
"does",
"this",
"plane",
"state",
"that",
"it",
"'",
"s",
"easy"
] | 1,024 | 620 | "https://c1.staticflickr.com/3/2828/9106837957_9136bd1b52_o.jpg" | "https://c7.staticflickr.com/3/2828/9106837957_41a172a688_z.jpg" | [
"no",
"no",
"no",
"no",
"no",
"no",
"yes",
"no, ez",
"unanswerable",
"unanswerable"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1c28f57e08fc7858" | 84 | "what is the id number on the bottom of the wing?" | [
"what",
"is",
"the",
"id",
"number",
"on",
"the",
"bottom",
"of",
"the",
"wing"
] | 1,024 | 620 | "https://c1.staticflickr.com/3/2828/9106837957_9136bd1b52_o.jpg" | "https://c7.staticflickr.com/3/2828/9106837957_41a172a688_z.jpg" | [
"n5588n",
"n5588n",
"n5588n",
"n5588n",
"n5588n",
"n5588n",
"n5588n",
"465688m",
"n5588n",
"n5588n"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1d1b9a571d21441e" | 85 | "what is the helicopter's number?" | [
"what",
"is",
"the",
"helicopter",
"'",
"s",
"number"
] | 1,024 | 646 | "https://c7.staticflickr.com/3/2105/5816184329_31a25758a6_o.jpg" | "https://c7.staticflickr.com/3/2105/5816184329_261a5522e4_z.jpg" | [
"1",
"1",
"1",
"pp jdr",
"1",
"pp-jdr",
"pp jdr",
"22",
"1",
"2007"
] | [
"Vehicle",
"Helicopter",
"Aircraft"
] | "train" |
|
"1d1b9a571d21441e" | 86 | "which website is listed on the picture?" | [
"which",
"website",
"is",
"listed",
"on",
"the",
"picture"
] | 1,024 | 646 | "https://c7.staticflickr.com/3/2105/5816184329_31a25758a6_o.jpg" | "https://c7.staticflickr.com/3/2105/5816184329_261a5522e4_z.jpg" | [
"flickr",
"flickr.com/photos/degu_andre",
"flickr.com",
"flickr.com/photos/degu_andre",
"flickr.com/photos/degu_andre",
"flickr.com",
"flickr.com/photos/degu_andre",
"flickr.com",
"flikr.com/photos/degu_andre",
"flickr.com"
] | [
"Vehicle",
"Helicopter",
"Aircraft"
] | "train" |
|
"1e4fcfbd0bb6e1e1" | 87 | "what is the slogan on the side say that life is for?" | [
"what",
"is",
"the",
"slogan",
"on",
"the",
"side",
"say",
"that",
"life",
"is",
"for"
] | 1,024 | 645 | "https://farm2.staticflickr.com/3076/3118659187_482c4be28c_o.jpg" | "https://c6.staticflickr.com/4/3076/3118659187_e21f204141_z.jpg" | [
"sharing ",
"life is for sharing",
"sharing",
"life is for sharing",
"lite is for sharing",
"sharing",
"life is for sharing",
"sharing",
"airplane",
"audite"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"1e4fcfbd0bb6e1e1" | 88 | "what brand is being advertised on the plane?" | [
"what",
"brand",
"is",
"being",
"advertised",
"on",
"the",
"plane"
] | 1,024 | 645 | "https://farm2.staticflickr.com/3076/3118659187_482c4be28c_o.jpg" | "https://c6.staticflickr.com/4/3076/3118659187_e21f204141_z.jpg" | [
"t mobile",
"t mobile",
"t mobile",
"t-mobile",
"t mobile",
"t-mobile",
"t-mobile",
"t-mobile",
"t",
"t-mobile"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"20db2e4f0602e5aa" | 89 | "what is the number on the tail wing?" | [
"what",
"is",
"the",
"number",
"on",
"the",
"tail",
"wing"
] | 1,024 | 680 | "https://c3.staticflickr.com/8/7618/17149400576_9626a83e86_o.jpg" | "https://c8.staticflickr.com/8/7618/17149400576_c5675a3e5f_z.jpg" | [
"031",
"031",
"031",
"031",
"031",
"031",
"031",
"031",
"031",
"031"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"210c80055b9f8e1a" | 90 | "what number is on the nose of this aircraft?" | [
"what",
"number",
"is",
"on",
"the",
"nose",
"of",
"this",
"aircraft"
] | 1,024 | 768 | "https://c3.staticflickr.com/3/2322/3543331109_191bfc86c8_o.jpg" | "https://c8.staticflickr.com/3/2322/3543331109_e8c689de46_z.jpg?zz=1" | [
"202",
"202",
"202",
"202",
"202",
"202",
"202",
"202",
"202",
"202"
] | [
"Vehicle",
"Clothing",
"Airplane",
"Aircraft"
] | "train" |
|
"226d623d0c70664f" | 91 | "what is the identification number on the balloon?" | [
"what",
"is",
"the",
"identification",
"number",
"on",
"the",
"balloon"
] | 680 | 1,024 | "https://c8.staticflickr.com/1/387/19416117025_35e54452b3_o.jpg" | "https://c1.staticflickr.com/1/387/19416117025_24e91b578f_z.jpg" | [
"00-b2w",
"00-bzw",
"00-bzw",
"00-bzw",
"00-bzw",
"00-9zm",
"00-bzw",
"00-bzw",
"00-bzw",
"00-bzw"
] | [
"Toy",
"Balloon",
"Vehicle",
"Clothing",
"Aircraft"
] | "train" |
|
"226d623d0c70664f" | 92 | "what is the last letter on the balloon?" | [
"what",
"is",
"the",
"last",
"letter",
"on",
"the",
"balloon"
] | 680 | 1,024 | "https://c8.staticflickr.com/1/387/19416117025_35e54452b3_o.jpg" | "https://c1.staticflickr.com/1/387/19416117025_24e91b578f_z.jpg" | [
"w",
"e",
"e",
"e",
"e",
"n",
"e",
"a",
"e",
"w"
] | [
"Toy",
"Balloon",
"Vehicle",
"Clothing",
"Aircraft"
] | "train" |
|
"229c0c1a9abfcd9e" | 93 | "what is the airline this plane belongs to?" | [
"what",
"is",
"the",
"airline",
"this",
"plane",
"belongs",
"to"
] | 1,024 | 683 | "https://c7.staticflickr.com/9/8088/8536501186_d1841cefc3_o.jpg" | "https://c4.staticflickr.com/9/8088/8536501186_f322c7e575_z.jpg" | [
"eva air",
"eva air",
"eva air",
"eva air",
"eva air",
"eva air",
"eva air",
"eva air",
"eva air",
"eva air"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"229c0c1a9abfcd9e" | 94 | "what is the 6 digit numbers on the side of the plane?" | [
"what",
"is",
"the",
"6",
"digit",
"numbers",
"on",
"the",
"side",
"of",
"the",
"plane"
] | 1,024 | 683 | "https://c7.staticflickr.com/9/8088/8536501186_d1841cefc3_o.jpg" | "https://c4.staticflickr.com/9/8088/8536501186_f322c7e575_z.jpg" | [
"777-300",
"777-300",
"777300",
"777-900",
"777-300",
"777300",
"777 300",
"777-300",
"777-300",
"777-300"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"233a9ff6b3175939" | 95 | "what year was this photograph taken?" | [
"what",
"year",
"was",
"this",
"photograph",
"taken"
] | 1,024 | 790 | "https://c2.staticflickr.com/1/193/493783526_571bf5d618_o.jpg" | "https://c4.staticflickr.com/1/193/493783526_c693ecc6a9_z.jpg?zz=1" | [
"1949",
"1949",
"1949",
"1949",
"1949",
"1949",
"1949",
"1949",
"1949",
"1949"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"233a9ff6b3175939" | 96 | "what is the model of this aircraft?" | [
"what",
"is",
"the",
"model",
"of",
"this",
"aircraft"
] | 1,024 | 790 | "https://c2.staticflickr.com/1/193/493783526_571bf5d618_o.jpg" | "https://c4.staticflickr.com/1/193/493783526_c693ecc6a9_z.jpg?zz=1" | [
"17 gc mcrexf ",
"452967",
"c-87",
"b52",
"c-87",
"17 gc mcrexf 19jan49 c-87",
"452907",
"452907",
"unanswerable",
"452967"
] | [
"Tree",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"24456069fd6847a8" | 97 | "what is the model airplane model number?" | [
"what",
"is",
"the",
"model",
"airplane",
"model",
"number"
] | 768 | 768 | "https://c1.staticflickr.com/4/3414/5707967525_a9a8db5e1b_o.jpg" | "https://c2.staticflickr.com/4/3414/5707967525_62463523ab_z.jpg" | [
"v268",
"107",
"s107",
"s107 or s1076",
"s107",
"v268",
"s107",
"s107",
"s107",
"s107"
] | [
"Person",
"Musical instrument",
"Musical keyboard",
"Vehicle",
"Helicopter",
"Clothing",
"Jeans",
"Aircraft"
] | "train" |
|
"2489dc9e42de1b36" | 98 | "what number is in red on the plane?" | [
"what",
"number",
"is",
"in",
"red",
"on",
"the",
"plane"
] | 1,024 | 683 | "https://farm7.staticflickr.com/4019/4654240354_19b1ec2f0b_o.jpg" | "https://c3.staticflickr.com/5/4019/4654240354_57c0a3a6c7_z.jpg" | [
"4858",
"1",
"4858",
"48592",
"4858",
"4858",
"unanswerable",
"48592",
"not a question",
"4859"
] | [
"Ski",
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
|
"24ad0e52d2f80d92" | 99 | "what's the number written on the plane?" | [
"what",
"'",
"s",
"the",
"number",
"written",
"on",
"the",
"plane"
] | 1,024 | 768 | "https://farm3.staticflickr.com/16/22937592_00d9b5a8b6_o.jpg" | "https://c2.staticflickr.com/1/16/22937592_00d9b5a8b6_z.jpg?zz=1" | [
"63",
"63",
"63",
"63",
"63",
"63",
"cool stuff",
"63",
"63",
"63"
] | [
"Vehicle",
"Airplane",
"Aircraft"
] | "train" |
Dataset Card for TextVQA
Dataset Summary
TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions. TextVQA dataset contains 45,336 questions over 28,408 images from the OpenImages dataset. The dataset uses VQA accuracy metric for evaluation.
Supported Tasks and Leaderboards
visual-question-answering
: The dataset can be used for Visual Question Answering tasks where given an image, you have to answer a question based on the image. For the TextVQA dataset specifically, the questions require reading and reasoning about the scene text in the given image.
Languages
The questions in the dataset are in English.
Dataset Structure
Data Instances
A typical sample mainly contains the question in question
field, an image object in image
field, OpenImage image id in image_id
and lot of other useful metadata. 10 answers per questions are contained in the answers
attribute. For test set, 10 empty strings are contained in the answers
field as the answers are not available for it.
An example look like below:
{'question': 'who is this copyrighted by?',
'image_id': '00685bc495504d61',
'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=384x512 at 0x276021C5EB8>,
'image_classes': ['Vehicle', 'Tower', 'Airplane', 'Aircraft'],
'flickr_original_url': 'https://farm2.staticflickr.com/5067/5620759429_4ea686e643_o.jpg',
'flickr_300k_url': 'https://c5.staticflickr.com/6/5067/5620759429_f43a649fb5_z.jpg',
'image_width': 786,
'image_height': 1024,
'answers': ['simon clancy',
'simon ciancy',
'simon clancy',
'simon clancy',
'the brand is bayard',
'simon clancy',
'simon clancy',
'simon clancy',
'simon clancy',
'simon clancy'],
'question_tokens': ['who', 'is', 'this', 'copyrighted', 'by'],
'question_id': 3,
'set_name': 'train'
},
Data Fields
question
: string, the question that is being asked about the imageimage_id
: string, id of the image which is same as the OpenImages idimage
: APIL.Image.Image
object containing the image about which the question is being asked. Note that when accessing the image column:dataset[0]["image"]
the image file is automatically decoded. Decoding of a large number of image files might take a significant amount of time. Thus it is important to first query the sample index before the"image"
column, i.e.dataset[0]["image"]
should always be preferred overdataset["image"][0]
.image_classes
: List[str], The OpenImages classes to which the image belongs to.flickr_original_url
: string, URL to original image on Flickrflickr_300k_url
: string, URL to resized and low-resolution image on Flickr.image_width
: int, Width of the original image.image_height
: int, Height of the original image.question_tokens
: List[str], A pre-tokenized list of question.answers
: List[str], List of 10 human-annotated answers for the question. These 10 answers are collected from 10 different users. The list will contain empty strings for test set for which we don't have the answers.question_id
: int, Unique id of the question.set_name
: string, the set to which this question belongs.
Data Splits
There are three splits. train
, validation
and test
. The train
and validation
sets share images with OpenImages train
set and have their answers available. For test set answers, we return a list of ten empty strings. To get inference results and numbers on test
set, you need to go to the EvalAI leaderboard and upload your predictions there. Please see instructions at https://textvqa.org/challenge/.
Dataset Creation
Curation Rationale
From the paper:
Studies have shown that a dominant class of questions asked by visually impaired users on images of their surroundings involves reading text in the image. But today’s VQA models can not read! Our paper takes a first step towards addressing this problem. First, we introduce a new “TextVQA” dataset to facilitate progress on this important problem. Existing datasets either have a small proportion of questions about text (e.g., the VQA dataset) or are too small (e.g., the VizWiz dataset). TextVQA contains 45,336 questions on 28,408 images that require reasoning about text to answer.
Source Data
Initial Data Collection and Normalization
The initial images were sourced from OpenImages v4 dataset. These were first filtered based on automatic heuristics using an OCR system where we only took images which had at least some text detected in them. See annotation process section to understand the next stages.
Who are the source language producers?
English Crowdsource Annotators
Annotations
Annotation process
After the automatic process of filter the images that contain text, the images were manually verified using human annotators making sure that they had text. In next stage, the annotators were asked to write questions involving scene text for the image. For some images, in this stage, two questions were collected whenever possible. Finally, in the last stage, ten different human annotators answer the questions asked in last stage.
Who are the annotators?
Annotators are from one of the major data collection platforms such as AMT. Exact details are not mentioned in the paper.
Personal and Sensitive Information
The dataset does have similar PII issues as OpenImages and can at some times contain human faces, license plates, and documents. Using provided image_classes
data field is one option to try to filter out some of this information.
Considerations for Using the Data
Social Impact of Dataset
The paper helped realize the importance of scene text recognition and reasoning in general purpose machine learning applications and has led to many follow-up works including TextCaps and TextOCR. Similar datasets were introduced over the time which specifically focus on sight-disabled users such as VizWiz or focusing specifically on the same problem as TextVQA like STVQA, DocVQA and OCRVQA. Currently, most methods train on combined dataset from TextVQA and STVQA to achieve state-of-the-art performance on both datasets.
Discussion of Biases
Question-only bias where a model is able to answer the question without even looking at the image is discussed in the paper which was a major issue with original VQA dataset. The outlier bias in answers is prevented by collecting 10 different answers which are also taken in consideration by the evaluation metric.
Other Known Limitations
- The dataset is english only but does involve images with non-English latin characters so can involve some multi-lingual understanding.
- The performance on the dataset is also dependent on the quality of OCR used as the OCR errors can directly lead to wrong answers.
- The metric used for calculating accuracy is same as VQA accuracy. This involves one-to-one matching with the given answers and thus doesn't allow analyzing one-off errors through OCR.
Additional Information
Dataset Curators
- Amanpreet Singh
- Vivek Natarjan
- Meet Shah
- Yu Jiang
- Xinlei Chen
- Dhruv Batra
- Devi Parikh
- Marcus Rohrbach
Licensing Information
CC by 4.0
Citation Information
@inproceedings{singh2019towards,
title={Towards VQA Models That Can Read},
author={Singh, Amanpreet and Natarjan, Vivek and Shah, Meet and Jiang, Yu and Chen, Xinlei and Batra, Dhruv and Parikh, Devi and Rohrbach, Marcus},
booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
pages={8317-8326},
year={2019}
}
Contributions
Thanks to @apsdehal for adding this dataset.
- Downloads last month
- 655
Data Sourcing report
No elements in this dataset have been identified as either opted-out, or opted-in, by their creator.