This page hosts Flickr8K-CN, a bilingual extension of the popular Flickr8K set, used for evaluating image captioning in a cross-lingual setting.
- Chinese sentences written by native Chinese speakers
- Chinese sentences generated by Baidu translation
- Chinese sentences generated by Google translation
- Chinese sentences generated by human translation
- Original English sentences
- Data partition
- Image features
- 1,024-dim GoogleNet pool5 layer (read by bigfile.py)
Chinese sentences | Flickr8k-train | Flickr8k-val | Flickr8k-test |
---|---|---|---|
machine translation (google) | + | + | + |
machine translation (baidu) | + | + | + |
human translation | – | – | + |
human written | + | + | + |
Reference
Adding Chinese Captions to Images. In: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval (ICMR), pp. 271–275, 2016.