TheThinkingMachine
Menu
  • Home
  • Academic
    • The Maths of AI – An introduction
    • Artificial Intelligence -a MIT Short course
  • Cyber-Defence
  • SPECTER
Menu

Breakthrough means Cleverer Robots

Posted on July 8, 2024July 8, 2024 by Webmaster

Northwestern University engineers have developed a new artificial intelligence (AI) algorithm designed specifically for smart robotics helping robots rapidly and reliably learn complex skills, called Maximum Diffusion Reinforcement Learning (MaxDiff RL). This new algorithm encourages robots to randomly explore their environments to gain as much experience as possible and by using high-quality simulated exploration data, robots demonstrated faster, more efficient learning, improving their reliability and performance and those robots using MaxDiff RL consistently outperformed other state-of-the-art models. (see research  in the journal Nature Machine Intelligence).

The new algorithm works so well that in some tasks, robots were able to successfully performed tasks in a single attempt. “……Other AI frameworks can be somewhat unreliable, and sometimes robots will totally nail a task, but, other times, they will fail completely. With this new framework, as long as the robot is capable of solving the task at all, robots do exactly what they’ve been asked to do, making its easier to interpret robot successes and failures, which is crucial.”

Training of machine-learning algorithm requires huge quantities of filtered and curated data, and AI uses this to train until they reach optimal results, but this doesn’t work well for robots because robots typically need to collect data by themselves and traditional algorithms are not compatible because disembodied systems can take advantage of a world where physical laws do not apply and as AI failures have no consequences, but in robotics, one failure could be catastrophic. By learning through self-curated random experiences, using MaxDiff RL, robots acquire necessary skills to accomplish useful tasks, but the most impressive element is that robots using the MaxDiff RL method often succeeded at correctly performing a task in a single attempt, even when they started with no knowledge.

As MaxDiff RL is a general algorithm, it can be used for a variety of applications, paving the way for reliable decision-making in smart robotics, not only for robotic vehicles that move around, but for stationary robots learning too do complex local tasks.

Category: UK

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • The Tesla trolley Problem – success
  • The Greens were wrong! It is Microbes not Fossil Fuels!
  • The Doctor has already seen you!
  • AI assisted North Korean cyber-criminals being hired in US, UK, Europe and Australia
  • AI learns to teach and improve AI

Recent Comments

No comments to show.

Recent Comments

    Tags

    Academic Papers AI Tools Escalating threat to democracy Regulation Techsistential Risk Work

    Archives

    • November 2024
    • October 2024
    • September 2024
    • July 2024
    • June 2024
    • April 2024
    • November 2023
    • July 2023
    • June 2023
    • May 2023
    • April 2023
    • March 2023
    • September 2022
    • April 2016

    Categories

    • ACADEMIC
    • AI Books
    • Asia
    • CASELAW
    • ETHICS
    • European
    • LEGISLATIVE
    • NEWS
    • RISK
    • TECHNOLOGY
    • UK
    • US
    © 2026 TheThinkingMachine | Powered by Minimalist Blog WordPress Theme