The new OpenAI operator agent will change the world forever.
This is going to be a real AI agent that actually works — unlike gimmicks like AutoGPT.
Soon AI will be able to solve complex goals with lots of interconnected steps.
Completely autonomous — no continuous prompts — zero human guidance apart from dynamic input for each step.
Imagine you could just tell ChatGPT “teach me French” and that’ll be all it needs…
- Analyzing your French level with a quick quiz
- Crafting a comprehensive learning plan
- Setting phone and email reminders to help you stick to your plan…
This is basically the beginning of AGI — if it isn’t already.
And when you think of it this is already what apps like Duolingo try to do — solving complex problems.
But an AI agent will do this in far more comprehensive and personalized way — intelligently adjusting to the user’s needs and changing desires.
You can say something super vague like “plan my next holiday” and instantly your agent gets to work:
- Analyzes your calendar to know the best next holiday time
- Figures out someone you’ll love from previous conversations that stays within your budget
- Books flights and sets reservations according to your schedule
This will change everything.
Which is why they’re not the only ones working on agents — the AI race continues…
We have Google apparently working on “Project Jarvis” — an AI agent to automate web-based tasks in Chrome.
Automatically jumping from page to page and filling out forms and clicking buttons.
Maybe something like Puppeteer — a dev tool programmers use to make the browser do stuff automatically — but it isn’t hard-coded and it’s far more flexible.
Anthropic has already released their own AI agent in Claude 3.5 Sonnet — a groundbreaking “computer use” feature.
Google and Apple will probably have a major advantage over OpenAI and Anthropic though — cause of Android and iOS.
Gemini Android and Apple Intelligence could seamlessly switch between all your mobile apps for a complex chain of actions.
Since they have deep access to the OS they could even use the apps without having to open them visually.
They’ll control system settings.
You call the Apple Intelligence agent, “Send a photo of a duck to my Mac”, and it’ll generate an image of a duck, turn on Airdrop on iPhone, send the photo and turn Airdrop back off.
But the most power all these agents will have comes from the API interface — letting you build tools to plug into the agent.
Like you can create a “Spotify” tool that’ll let you play music from the agent. Or a “Google” tool to check your mails and plan events with your calendar.
So it all really looks promising — and as usual folks like Sam Altman are already promising the world with it.
AI agents may well be the future—personalized, autonomous, and powerful. They’ll revolutionize how we learn, plan, and interact. The race is on.
We may see devastating job-loss impacts in several industries — including software development…
Let’s see how it goes.