OpenAI introduces GPT-4o (“o” for “omni”), its latest model, designed to extend advanced AI functionalities to all users, including those on free plans. This release marks a notable development in AI technology, aiming to make high-level AI tools more accessible and user-friendly.


Improved Reasoning – GPT-4o sets a new high-score of 88.7% on 0-shot COT MMLU (general knowledge questions):


Key Takeaways:

  • GPT-4o brings GPT-4 caliber intelligence to diverse domains (the o for omni) such as text, vision, and audio, enhancing AI accessibility.
  • GPT-4o has been trained from the ground up to treat text, audio, and visual data equally and transform it all into tokens instead of relying on turning everything into text as before, allowing for the speed increase (x2), cost decrease (50%) and 5x higher rate limits compared to previous model.
  • The model features real-time responsiveness, facilitating smoother and more natural interactions without the delays typical of earlier versions.
  • GPT-4o exhibits emotional intelligence, capable of detecting and responding to user emotions through tone adjustments and empathetic engagement.
  • For the first time, advanced features are available to free users, broadening the reach and impact of sophisticated AI tools.
  • The model supports multilingual capabilities, allowing for real-time language translation, which could revolutionize communication across different language speakers.
  • GPT-4o offers code assistance, providing insights and suggestions on coding queries without direct access to the code.
  • Azure OpenAI offers GPT-4o in preview, contradicting the usual expectations of delayed releases and showcasing their commitment to being at the forefront of AI technology.

References: