GSoC 2024 Project Recap

End-to-End Autonomous Vehicle Driving Based on Text-Based Instructions

Mentors: Sergio Paniego Blanco, Apoorv Garg, Nikhil Paliwal, David Pérez, and Meiqi Zhao

Summary

The 2024 GSoC project focused on the development of an end-to-end system in CARLA, built upon multimodal large models and a modified PilotNet architecture. Specifically, the agent component translates user inputs provided as natural language commands into structured control signals, including target distance and high-level control (HLC). These signals are processed by the control model, which uses distance-aware metrics to guide vehicle behavior. Furthermore, a Streamlit-based web app was developed to streamline interaction and training by providing an interface for generating and training user commands with LLMs. Finally, the entire system was tested in CARLA, where the generated control signals were applied to a simulated vehicle.

I would like to thank Google Summer of Code (GSoC), JdeRobot, and everyone who supported me throughout this project for the opportunity to contribute to the open-source community. I am especially grateful to my mentors for their guidance, particularly Sergio Paniego Blanco, whose insights and guidance were instrumental to my progress.

Weekly Blog

Week 21

We concluded the model setup by incorporating all the components into one model to test and evaluate.

Week 20

In Week 20, we improved CARLA’s vehicle control system by integrating distance-aware metrics. This update required refining data collection to track new metrics. A proof of concept showed successful action-switching based on distance, with plans to analyze datasets using HDF5 and scale up model training.

Week 19

During Week 19, we parsed CARLA’s Town02 map in OpenDRIVE format, generating structured test cases with start/end points.

Week 18

In Week 18, we introduced a distance-aware control method in CARLA to refine autonomous actions based on distance traveled, developed prompts for distance-specific commands, and validated real-time distance measurement in Town01.

Week 17

In this week, we explored integrating Visual Question Answering with reinforcement learning to enhance autonomous driving.

Weeks 15-16

In Week 15 and 16, we explored ideas to enhance autonomous vehicle capabilities through natural language instructions, particularly for complex, multi-turn interactions. Key efforts included compiling a list of potential research directions, assessing technical feasibility.

Week 14

In Week 14, we developed an "Ideas List" to explore potential research paths and merged new app updates into the main branch, including model testing in CARLA and a Streamlit web app. Documentation was refreshed, and recent progress was shared on social media.

Weeks 12-13

For the Weeks of 12 and 13, we focused on improving model evaluation and setting future research goals. Key tasks included enhancing model accuracy, addressing deployment issues, and preparing a social media update with a refined demo video.

Weeks 10-11

In Week 10 and 11, we focused on preparing for the mid-term evaluation, improving dataset generation, and building a Streamlit app to streamline model training and analysis. Key challenges included managing model storage and refining evaluation techniques.

GSoC 2024 Project Recap

End-to-End Autonomous Vehicle Driving Based on Text-Based Instructions

Summary

Weekly Blog

Week 21

Week 20

Week 19

Week 18

Week 17

Weeks 15-16

Week 14

Weeks 12-13

Weeks 10-11

Week 9

Week 8

Week 7

Week 6

Weeks 3-5

Week 2

Week 1

Community Bonding Week 2

Community Bonding Week 1

Pull Requests