This major AI company came to iMerit to implement a vision-language-action model to improve model explainability, decision-making transparency, and overall safety.

We needed curated data from experts who understand the rules of the road as well as what our models need to perform to make our demo a success.”

– Head of Computer Vision

 

Problem 

To improve model performance on US roads, this company needed a unique dataset with detailed information around human driving styles, rules of the road, and how to interact verbally with the passenger.

Solution

iMerit autonomous vehicle domain experts assessed real and synthetic driving scenarios and conditions to classify different features of each including traffic signs, brake lights, and crosswalks. Environmental abnormalities such as construction zones, collisions, and pedestrians out of place were classified and used to train the VLM to ask for guidance from vehicle operators when challenged.

Results

After training the model with iMerit datasets, the client found iMerit’s classifications were 95% accurate. iMerit also successfully improved time-per-task by 50%. The VLM was able to provide more clarity of the vehicle’s actions to the vehicle operator.

 

 

BOTTOM LINE IMPACT

50%

Time-Per-Task Improvement

95%

Classification Accuracy

50%

Improved Efficiency