Learn bits
Science & Tech.
Mahesh

15/05/24 12:55 PM IST

GPT-4o AI model

In News
  • OpenAI livestreamed the launch of its new flagship AI model, the GPT-4o, capable of accepting audio and visual inputs and generating output almost flawlessly.
GPT-4o
  • OpenAI ran several demos to show off the diverse abilities of GPT-4o across audio, images, and text.
  • The AI interface, based on a user’s instructions, can turn a picture of a man into a caricature, create and manipulate a 3D logo, or attach a logo to an object.
  • It can also generate meeting notes based on an audio recording, design a cartoon character, and even make a stylised movie poster with real people’s photos.
  • In promotional video clippings, GPT-4o assessed a man’s readiness for an interview and made jokes about him for being dressed too casually, thus demonstrating its visual understanding.
  • In others, it helped set up a game, assisted a child in solving a math problem, recognised real-life objects in Spanish, and even expressed sarcasm.
  • OpenAI did not shy away from praising the new model, claiming that it beat existing rivals such as Claude 3 Opus and Gemini Ultra 1.0, as well as its own GPT-4 offering, in several areas across text evaluation and vision understanding evaluations.
Challenges
  • While GPT-4o can process text, audio, and images, one noticeable omission is video generation - despite the model’s vision understanding capability.
  • So, users cannot ask GPT-4o to give them a fleshed-out movie trailer, but they can ask the model questions about their surroundings by making the AI see the user’s environment through their smartphone’s camera.
  • Furthermore, GPT-4o made some slip-ups and errors when demonstrating its abilities. For example, when converting two portraits into a crime movie-style poster, the model initially produced gibberish instead of text.
  • Though the results were later refined, the final product also had a slightly raw AI-generated feel.
  • GPT-4o comes at a crucial time for the ChatGPT-maker, which is now in competition with other Big Tech firms fine-tuning their own models or turning them into business tools.
Source- The Hindu

More Related Current Affairs View All

23 May

Analysing local environmental footprints of luxury consumption

'While climate change is a global concern, issues such as water scarcity and air pollution are often localised or regionalised. For example, excessive water use in one region may n

Read More

23 May

Amul goes global

“The Taste of India. Now in USA,” India’s largest dairy co-operative Amul, i.e. Gujarat Cooperative Milk Marketing Federation (GCMMF), declared in a newspaper adv

Read More

23 May

Animal Diplomacy

'Amid serious world politics, there is an odd yet effective diplomatic tool known as animal diplomacy.' In a recent development, Malaysia, the world’s second-largest palm

Read More

India’s First Ai-Driven Magazine Generator

Generate Your Custom Current Affairs Magazine using our AI in just 3 steps