The NEU-171K dataset supports both supervised and open-vocabulary object detection (3F-OVD) protocols. Each image is annotated using a COCO-style format with split files for training, validation, and test sets.
It has 145,825 images, 676,471 bounding boxes, 719 fine-grained classes and contains the following two sub-sets.
NEU-171K-C contains cars in real-world traffic scenes.

NEU-171K-RP is a subset of the NEU-171K dataset, focusing on fine-grained object detection in the domain of retail products. It includes 56,462 high-resolution images captured under controlled warehouse-like conditions, annotated with 360,377 bounding boxes across 121 distinct product classes.
Compared to NEU-171K-C, this subset offers clearer object boundaries and visually consistent backgrounds, making it ideal for studying visual details that distinguish fine-grained categories.
The structure of the NEU-171K-RP is like the follows:
- train
- images.zip
- instances_rp_train.json
- val
- images.zip
- instances_rp_val.json
- test
- images.zip
- instances_rp_test.json
- rp_categories.csv
The structure of the NEU-171K-C is similar as that of NEU-171K-RP, but replacing rp with car from the above.
The dataset can only be used for research only. This dataset is strictly for research purposes only. If you believe that our dataset violates your privacy, please feel free to contact us.
NEU-171K contains both NEU-171K-C and NEU-171K-RP as its sub-folders in the root directory.
