Robots learn tasks from observing demonstrations in USC study

Researchers from the University of Southern California (USC) are developing a system that enables robots to autonomously learn tasks from observing demonstrations.

USC researchers are working on a system that would teach robots tasks such as setting a table or driving a car from viewing a handful of demonstrations. The work has been published in a report Demonstrations Using Signal Temporal Logic.

The report details how the system evaluates the quality of each demonstration, learning mistakes as well as successes. This allows robots to learn from only a few demonstrations rather than current methods which take over 100 demonstrations.

Furthermore, it enables robots to learn intuitively similar to the way humans learn from each other.

Lead author Aniruddh Puranic, a Ph.D. student in computer science at the USC Viterbi School of Engineering, said: “Many machine learning and reinforcement learning systems require large amounts of data and hundreds of demonstrations, you need a human to demonstrate over and over again, which is not feasible.

“Also, most people don’t have programming knowledge to explicitly state what the robot needs to do, and a human cannot possibly demonstrate everything that a robot needs to know. What if the robot encounters something it hasn’t seen before? This is a key challenge.”

Using USC research methods, an autonomous driving system would be able to learn safe driving skills

There are safety concerns that imperfections in demonstrations can lead to robots learning unsafe or undesirable actions. The research looks to address these issues with Signal Temporal Logic (STL) which evaluates the quality of demonstrations and automatically ranks them to create inherent rewards.

Co-author Stefanos Nikolaidis, a USC Viterbi assistant professor of computer science, said: “Let’s say robots learn from different types of demonstrations, it could be a hands-on demonstration, videos, or simulations, if I do something that is very unsafe, standard approaches will do one of two things: either, they will completely disregard it, or even worse, the robot will learn the wrong thing.

“In contrast, in a very intelligent way, this work uses some common-sense reasoning in the form of logic to understand which parts of the demonstration are good and which parts are not. In essence, this is exactly what also humans do.”

The report used a driving demonstration as an example. If a driver skips a stop sign this would be ranked lower by the robot than in a demonstration where a driver applies the brakes to avoid a crash. The robot will learn from this smart action, adapting to human preferences.

Nikolaidis added: “If we want robots to be good teammates and help people, first they need to learn and adapt to human preference very efficiently.”

Cookie	Duration	Description
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
CONSENT	2 years	YouTube sets this cookie via embedded YouTube videos and registers anonymous statistical data.

Cookie	Duration	Description
OAID	1 year	Cookie set to record whether the user has opted out of the collection of information by the AdsWizz Service Cookies.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Robotics & Automation – November 2023

Robotics & Automation – March 2024

Robotics & Automation – November 2023

Robotics & Automation – July 2023

Robots learn tasks from observing demonstrations in USC study

‘World’s most advanced’ humanoid to be displayed in Scotland

US and UK join forces for AI safety partnership

Care homes could benefit from ‘world’s first’ bimanual dressing robot

CMA increases scrutiny of Big Tech investments in AI start-ups

Musk says Tesla humanoid robots could be on sale ‘by end of 2025’

Partnership forms to integrate bipedal robots into US warehouses

Japanese AMR firm pursues US expansion

Upcoming Events

Industrial & Property Logistics Conference 2024

Road User Charging Conference USA 2024

Road User Charging Conference MEA 2024

Robots learn tasks from observing demonstrations in USC study

Related Stories

Upcoming Events