The Emergence of Large Action Models (LAMs)

Recent advancements in artificial intelligence have brought to light an innovative concept known as Large Action Model (LAM), which is the cornerstone of rabbit OS (a new type of foundation model that understands human intentions on computers). With LAM, rabbit OS understands what you say and gets things done.

LAM is exemplified in the newly introduced Rabbit R1 gadget at CES 2024 and signifies a shift in AI capabilities from mere language processing to executing real-world actions based on textual instructions. The Rabbit R1’s integration of a LAM provides an intuitive user experience, marking a significant evolution in AI technology.

See LAM in action on Rabbit r1:


Understanding LAMs and Their Functionality

LAMs are designed to understand and execute human intentions on computers, primarily through a natural language-centered approach.

LAMs have the ability to revolutionize technology interaction by extending their functionality beyond generating and interpreting text, unlike LLMs:

  • LAMs learn by watching people use interfaces and can imitate these actions reliably.
  • They understand how applications and services are used every day, focusing on what users want to do instead of just following a set of rules.
  • LAMs can learn to use any interface from any software, essentially bridging the gap between understanding language and doing tasks.
  • LAMs can move around digital spaces and do many different tasks, like booking flights or controlling smart home devices.
  • They show us a possible future where AI models make operating systems less important, which could change how UI/UX designers and developers work.
  • The Rabbit R1, which is powered by a LAM, puts emphasis on easy gestures and voice commands instead of the usual interfaces, encouraging a more natural way of using technology.