> For the complete documentation index, see [llms.txt](https://qiqiqi.gitbook.io/mixed-traffic/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://qiqiqi.gitbook.io/mixed-traffic/overview/relevant-datasets.md).

# Relevant Datasets

## Selected Online Open-source Datasets

<table data-header-hidden><thead><tr><th width="157"></th><th width="188"></th><th width="170"></th><th width="257"></th></tr></thead><tbody><tr><td>Dataset</td><td><p>Data Description</p><p>(Type and Volume)</p></td><td><p>Relevant Tasks</p><p>and Case Studies</p></td><td>Data Samples Screenshot</td></tr><tr><td><a href="https://bdd-data.berkeley.edu/">Berkeley Deep Drive： BDD 100k</a></td><td><p>Image &#x26; video with annotation;</p><p>100K video clips &#x26; images,1.8TB</p></td><td><p>Perception:</p><p>Semantic segmentation;</p><p>Lane detection</p></td><td><img src="/files/ajYIoKHwlrHwFFBwK1Lp" alt=""></td></tr><tr><td><a href="https://innovation-mobility.com/en/project-providentia/a9-dataset/">TUMTraf Dataset</a></td><td>Involves 7 sensor stations equipped with more than 60 SOTA and multi-modal sensors, and covered a road network of approximately 3.5 kilometres, R0, R1, R2 three different data sets</td><td><p>Perception;</p><p>Digital Twin;</p><p>Motion Prediction;</p></td><td><img src="/files/tRjXg0akpME9YoEDVdUM" alt="" data-size="original"></td></tr><tr><td><a href="https://waymo.com/open/">Waymo Open Dataset</a></td><td><p>Motion: TFRecord format with object trajectories and corresponding 3D maps for 103,354 segments;</p><p>Perception: Lidar and Camera data, labels for 2,030 segments</p></td><td><p>Motion: Motion Prediction, Interaction, Occupancy, and Flow Prediction, Sim Agents;</p><p>Perception: Segmentation, Object Detection &#x26; Tracking, Pose Estimation</p></td><td><img src="/files/fuxGPzuk46w2ouT9cBSo" alt=""></td></tr><tr><td><a href="https://woven.toyota/en/prediction-dataset">Lyft level-5 open dataset</a></td><td>170,000 scenes around automated vehicle; 1000+ hours; <a href="https://zarr.readthedocs.io/">zarr </a>format with <a href="https://woven-planet.github.io/l5kit/">python toolkit</a></td><td>Motion Prediction</td><td><img src="/files/nPPcipt5SwCb6a85rNOs" alt=""></td></tr><tr><td><a href="https://openaccess.thecvf.com/content_cvpr_2018/html/Ramanishka_Toward_Driving_Scene_CVPR_2018_paper.html">Honda Driving Datasets</a>: <a href="https://openaccess.thecvf.com/content_cvpr_2018/html/Ramanishka_Toward_Driving_Scene_CVPR_2018_paper.html">HDD</a>; <a href="https://arxiv.org/abs/1903.01568">H3D</a>; <a href="https://arxiv.org/abs/1905.12708">HSD</a>; <a href="https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8794474&#x26;casa_token=7cV3u2F9ljQAAAAA:jKUoLX_oWokQ_6qu5sn43uepvpKSfJBz-w6Ha1XEPnCyo5TkVAO9GdUd28RLP_8av7U3ItIhMw&#x26;tag=1">HEV-I</a>; <a href="https://arxiv.org/abs/1911.06978">HAD</a>; <a href="https://arxiv.org/abs/2003.13886">TITAN</a></td><td>104 hours of videos; GPS/IMU, CAN; etc.</td><td><a href="https://openaccess.thecvf.com/content_cvpr_2018/html/Ramanishka_Toward_Driving_Scene_CVPR_2018_paper.html">HDD</a>: Learning Driver Behaviour; Causal Reasoning; <a href="https://arxiv.org/abs/1903.01568">H3D</a>: 3D Multi-Object Detection and Tracking; <a href="https://arxiv.org/abs/1905.12708">HSD</a>:Traffic Scene Classification; <a href="https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8794474&#x26;casa_token=7cV3u2F9ljQAAAAA:jKUoLX_oWokQ_6qu5sn43uepvpKSfJBz-w6Ha1XEPnCyo5TkVAO9GdUd28RLP_8av7U3ItIhMw&#x26;tag=1">HEV-I</a>: Vehicle Localization; <a href="https://arxiv.org/abs/1911.06978">HAD</a>: Human-to-Vehicle Advice for end-to-end Self-driving; <a href="https://arxiv.org/abs/2003.13886">TITAN</a>: Trajectory Forecast</td><td><img src="/files/j4qDzdU3TtCeUXeZKItz" alt=""></td></tr><tr><td><a href="https://paperswithcode.com/dataset/d2city">D^2-City</a></td><td><p>10000 video clips; 12 classes bounding box, tracking ID, class ID;</p><p>Currently unreachable</p></td><td>Perception: Object Detection &#x26; Tracking</td><td><img src="/files/4YlCyWarOru1ul3swLba" alt=""></td></tr><tr><td><a href="http://www.dbehavior.net/">DBNet</a></td><td>Video, point cloud, GPS, and driver behaviour (speed and wheel); 1000 km</td><td>Driving Policy Prediction</td><td><img src="/files/iuUs4mdexCE5qzJT5VAP" alt=""></td></tr><tr><td><a href="https://github.com/ozheng1993/UCF-SST-CitySim-Dataset">CitySim</a></td><td>Drone-Based Vehicle Trajectory, 1140-minutes of drone videos@30 FPS recorded at 12 different locations</td><td><p>VR Driving Simulation;</p><p> </p><p>Digital Twin;</p><p> </p><p> Sensor Simulation;</p><p> </p><p>Driving Behaviour Analysis;</p><p> </p><p>Safety &#x26; Crash Analysis</p></td><td><img src="/files/8E8CqGy1zPaL4FotMnuW" alt=""></td></tr><tr><td><a href="https://www.highd-dataset.com/">highD</a>, <a href="https://www.ind-dataset.com/">inD</a>, <a href="https://www.round-dataset.com/">rounD</a>, <a href="https://www.exid-dataset.com/">exiD</a></td><td>Drone-based collection; 110500 vehicles; 147 hours; CSV; (Highway, Interaction, Roundabout)</td><td><p>Behaviour Extraction &#x26; Analysis; Intention / Behaviour / Motion Prediction;</p><p>Imitation Learning;</p></td><td><img src="/files/gxqikd8R2w2PmPew8PGo" alt=""></td></tr><tr><td><a href="https://www.cvlibs.net/datasets/kitti/">KITTI</a></td><td>2 grayscale cameras, 2 color cameras, 4 Edmund optics lenses, 1 3D laser scanner (10 HZ); 6 hours; 50 scenes, 180 GB</td><td><p> </p><p> </p><p>Perception: Object Detection &#x26; Tracking; Semantic and Instance Segmentation; Road/Lane Detection</p><p> </p><p> </p></td><td><img src="/files/nInfyqBYbkXeuM9epMMq" alt=""></td></tr><tr><td><a href="https://www.nuscenes.org/nuscenes">nuScenes</a></td><td>1000 driving scenes; 23 object classes annotated with 3D bounding boxes at 2Hz; 1.4M camera images, 390k LIDAR sweeps, 1.4M RADAR sweeps, and 1.4M object bounding boxes in 40k keyframes</td><td>Perception: 3D Detection and Tracking; Prediction</td><td><img src="/files/KiMnvFvCzfU4HI8803he" alt=""></td></tr><tr><td><a href="https://www.nuscenes.org/nuplan">nuPlan</a></td><td>1200h (Boston, Pittsburgh, Las Vegas and Singapore) + 838 (Las Vegas). 2D high definition maps. The states of all traffic lights are estimated. <a href="https://github.com/motional/nuplan-devkit">Python Toolkit</a> is provided.</td><td>Motion Planning, Motion Prediction</td><td><img src="/files/CbdkKnmZ13lO1rOu5rpb" alt="" data-size="original"></td></tr><tr><td><a href="https://www.argoverse.org/index.html">Argoverse 1 &#x26; 2</a></td><td><p>1: 3D Tracking Dataset with 113 3D annotated scenes;</p><p>2: Sensor Dataset with 1,000 3D annotated scenarios (lidar, ring camera, and stereo sensor data), Lidar Dataset with 20,000 unlabeled scenarios</p></td><td><p>Perception: 3D Tracking;</p><p> </p><p>Motion Forecasting</p></td><td><img src="/files/6yFRd12W6QN9Tg3Fv4aY" alt=""></td></tr><tr><td><a href="https://xingangpan.github.io/projects/CULane.html">CULane</a></td><td><p>Image (video) with annotation;</p><p>133K images</p></td><td><p>Perception:</p><p>Semantic segmentation;</p><p>Lane detection</p></td><td><img src="/files/Se6zrgx6wlgm7t9FHpw0" alt=""></td></tr><tr><td><a href="http://apolloscape.auto/scene.html">ApolloScape Baidu Inc.</a></td><td><p>Video with semantic annotation;</p><p>>140K images (video frames)</p></td><td><p>Perception:</p><p>Semantic segmentation;</p><p> </p><p>Object &#x26; Lane detection</p></td><td><img src="/files/bjxZucuGzM9mMVS65OuB" alt=""></td></tr><tr><td><a href="https://github.com/TuSimple/tusimple-benchmark/wiki">TuSimple</a></td><td><p>Image &#x26; video with annotation;</p><p>Two image sets：7K and 5K</p></td><td><p> </p><p> </p><p> </p><p>Perception:</p><p>Semantic segmentation;</p><p>Lane detection</p><p> </p><p> </p><p> </p></td><td><img src="/files/pYDAoqSdbpMSfT5LLgG1" alt=""></td></tr><tr><td><a href="https://sites.google.com/view/multispectral/">KAIST Multi-Spectral</a></td><td><p>Video, LiDAR, GPS;</p><p>10 videos</p></td><td><p>Perception:</p><p>Semantic segmentation;</p><p>Lane detection</p></td><td><img src="/files/4PDObKNWFX6eiBD7KKDs" alt=""></td></tr><tr><td><a href="https://sites.google.com/view/complex-urban-dataset">KAIST Urban Dataset</a></td><td><p>LiDAR and stereo images with various position sensors targeting a highly complex urban environment;</p><p>tar.gz</p></td><td>SLAM; Odometry</td><td><img src="/files/hKqosVpzQexJbRnnDJ5K" alt=""></td></tr><tr><td><a href="http://www.vision.ee.ethz.ch/~timofter/traffic_signs/">Belgium Traffic Sign Dataset</a></td><td>Image (Traffic sign) with annotation</td><td>Perception: Object Detection</td><td><img src="/files/iMcMaZvGXRzXT0HF8Tqi" alt=""></td></tr><tr><td><a href="http://mi.eng.cam.ac.uk/research/projects/VideoRec/CamVid/">CamVid</a></td><td>700+ images; 10+ minutes of high quality 30Hz footage with corresponding semantically labeled images at 1Hz and in part, 15Hz</td><td>Perception: Segmentation &#x26; Recognition</td><td><img src="/files/EuUepydvXAHXOp5fwVr7" alt=""></td></tr><tr><td><a href="http://data.nvision2.eecs.yorku.ca/JAAD_dataset/">JAAD York University</a></td><td><p>Video with  annotation (bounding box, behavioral label);</p><p>347 videos, 170GB</p></td><td><p>Perception:</p><p>Object Detection;</p><p> </p><p>Behaviour Analysis</p></td><td><img src="/files/6T9gWzXcaDWVU1hhZxlY" alt=""></td></tr><tr><td><a href="http://www.robesafe.uah.es/personal/eduardo.romera/uah-driveset/">UAH University of Alcalá</a></td><td><p>Video with behavioral label, GPS, vehicle data;</p><p>35 videos</p></td><td>Behavior analysis</td><td><img src="/files/xnyBhs1PdU1iPRjXVYpV" alt=""></td></tr><tr><td><a href="https://github.com/udacity/self-driving-car">Udacity self-driving-car</a></td><td><p>Video, LiDAR, GPS, vehicle with annotation (bounding box);</p><p> 300GB</p></td><td><p>Perception:</p><p>Object detection, Object tracking:</p><p> </p><p>End2End learning;</p><p>Imitation learning</p></td><td><img src="/files/cV6sK1Wa4iJOZYKBgG6n" alt=""></td></tr><tr><td><a href="http://cvrr.ucsd.edu/LISA/datasets.html">LISA: Laboratory for Intelligent &#x26; Safe Automobiles</a></td><td><p>Video (image) with annotation (vehicle and traffic sign);</p><p>3 (vehicle) + several (traffic sign) videos</p></td><td>Perception: Object Detection;</td><td><img src="/files/e5lDCpANWQSml51XvZaA" alt=""></td></tr><tr><td><a href="https://agelab.mit.edu/driveseg">MIT DriveSeg: Dynamic Driving Scene Segmentation</a></td><td><p>Video (image) with annotation;</p><p>5,000 (manual) + 20,100 (semi-auto) frames</p></td><td><p>Perception:</p><p>Object Detection, Semantic Segmentation;</p><p> </p><p>Imitation learning</p></td><td><img src="/files/klAt1fZhR8WhnMmRNtLu" alt=""></td></tr><tr><td><a href="https://www.a2d2.audi/a2d2/en.html">Audi Autonomous Driving Dataset (A2D2)</a></td><td><p>Image(video), LiDAR, with Semantic and Point cloud Segmentation, 3D bounding;</p><p>41,280 (image) +</p><p> 12,499 (3D) + 390,000 (unlabeled sensor) frames</p></td><td><p>Perception:</p><p>Object Detection, Object Tracking;</p><p> </p><p>End2End Learning;</p><p>Imitation Learning</p></td><td><img src="/files/yyQkeIZyvAdTuB59jt9l" alt=""></td></tr><tr><td><a href="https://www.mapillary.com/dataset/vistas?pKey=1697734990430617">Mapillary Vistas</a></td><td><p>25,000 high-resolution images;</p><p>124 semantic object categories;</p><p>100 instance-annotated categories;</p><p>Global reach, covering 6 continents</p></td><td><p>Perception:</p><p>Street-level Instance Segmentation</p></td><td><img src="/files/koIo0x9gx5JSiIR3qEOo" alt=""></td></tr><tr><td><a href="http://cadcd.uwaterloo.ca/">CADC: Canadian Adverse Driving Conditions Dataset</a></td><td><p>56,000 camera images; 7,000 LiDAR sweeps; 75 scenes of 50-100 frames each 10 annotation classes;</p><p>Full sensor suite: 1 LiDAR, 8 Cameras, Post-processed GPS/IMU;</p><p>Adverse weather conditions (snow)</p></td><td><p>Perception:</p><p>(3D) Object Detection, Object Tracking;</p><p> </p><p>Trajectory Prediction</p></td><td><img src="/files/p73KxtkeihzjUW0NCKoC" alt=""></td></tr><tr><td><a href="https://registry.opendata.aws/dc-lidar/">Lidar Data of Washington DC</a></td><td>LiDAR point cloud data;  LAS, XML, SHP</td><td><p>Perception:</p><p>(3D) Object Detection</p></td><td><img src="/files/kPWjaBfsrCj5dBbyHG1r" alt=""></td></tr><tr><td><a href="http://robotcar-dataset.robots.ox.ac.uk/">Oxford RobotCar</a></td><td>1 year, 1000 km; 20 million images along with LIDAR, GPS, and INS ground truth</td><td><p>Perception:</p><p>Object Detection, Object Tracking;</p><p> </p><p>Dense Reconstruction;</p><p> </p><p>Localization</p></td><td><img src="/files/0pdEjYO0xKnmo3P2YkSa" alt=""></td></tr><tr><td><a href="https://mobility-lab.seas.ucla.edu/v2v4real/">V2V4Real</a></td><td>Two vehicle cooperation simultaneously in the same location, 410 km of the driving area, 20K LiDAR, 40K RGB, and 240K annotated 3D bounding boxes across 5 vehicle classes</td><td><p>Perception: Vehicle-to-Vehicle Cooperative Perception; (3D) Object Detection, Tracking, Prediction, Localization; </p><p></p><p>Sim2Real Transfer Learning</p></td><td><img src="/files/2zbdrmxdaTBH31yP8PqT" alt=""></td></tr><tr><td><a href="https://catalog.data.gov/dataset/safety-pilot-model-deployment-data">Safety Pilot Model Deployment Data</a></td><td>Basic safety messages (BSM), vehicle trajectories, and various driver-vehicle interaction data; CSV format</td><td><p>Interactive Behaviour Extraction &#x26; Analysis;</p><p>Safety Analysis;</p><p>Driving Anomaly Detection</p></td><td><img src="/files/XMtm8s0s1AsG9B4PVX8s" alt=""></td></tr><tr><td><a href="http://interaction-dataset.com/">INTERACTION</a></td><td><p>Roundabout: 10479 trajectories, 365 mins; Unsignalized Intersection: 14867 trajectories, 433 mins; Lane change: 10933 trajectories,</p><p>133 mins; Signalized intersection: 3775 trajectories, 60 mins; High definition maps in lanelet2 format</p></td><td><p>Intention/Behaviour/Motion Prediction;</p><p> </p><p>Imitation Learning;</p><p>Reinforcement Learning;</p><p> </p><p>Interactive Behaviour Extraction &#x26; Analysis</p></td><td><img src="/files/L7BZSGhUVxA9Bklu4bWY" alt=""></td></tr></tbody></table>

&#x20;


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://qiqiqi.gitbook.io/mixed-traffic/overview/relevant-datasets.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
