GPT-5.2 is already here — and it certainly did not disappoint

By Tari Ibaba / Last updated on December 12, 2025

Just look at the incredible 3D ocean wave simulation GPT-5.2 created:

It generated all these 3D elements from scratch for the app:

Look what happens when we adjust one of the settings — like the height of the ocean waves:

And all this from a single prompt mind you.

This is huge.

Barely a month after the last GPT upgrade from OpenAI and another one is already here.

Google has been at their neck non-stop with major AI upgrade after upgrade — so this is no time to mess around and be complacent.

This model isn’t trying to be a fun novelty — it’s aiming to be a serious coworker with more structured responses.

A sophisticated typing game from a single GPT-5.2 prompt:

If GPT-5.1 is a sharp assistant then GPT-5.2 is someone you could actually hand a complex project to and expect them to deliver a full, coherent draft back.

What GPT-5.2 actually is

It comes in three main flavors inside ChatGPT:

Instant — the fast, “answer my question now” mode
Thinking — slows down to reason, plan, calculate, and work through multi-step tasks
Pro — the heavyweight, tuned for tough math, science, coding, and research problems

Everything under the hood is upgraded: its reasoning is stronger, its responses are more grounded, and it’s noticeably more consistent on tasks where earlier models sometimes drifted or hallucinated.

Where it genuinely improves

One of the big demos OpenAI pushes is how GPT-5.2 performs on actual knowledge-work tasks — things like building spreadsheets, creating financial models, drafting slides, writing briefs, planning events, or summarizing huge reports.

GPT-5.2 Thinking reportedly matches or outperforms human experts on most tasks, and it does the work far faster. The difference is visible in everyday use: when you ask it for something complex, it now knows how to structure the work instead of just giving a surface-level answer.

Coding

On software engineering benchmarks, GPT-5.2 sets a new high score. It fixes bugs more reliably, handles multi-file reasoning better, and performs more stable refactors. Developers online who use tools like Windsurf, JetBrains IDEs, and other AI coding assistants say it can stick with a problem longer without losing track.

You’ll feel this immediately if you work with front-end code, UI components, or large codebases.

Long context and agent workflows

This is a big upgrade: GPT-5.2 in the API can handle up to 400,000 tokens in one go. That’s hundreds of pages of text, or a genuine codebase, or a giant research bundle.

On top of that, there’s a new feature that lets the model compact long histories so that an agent or assistant can keep working for hours or days without choking on context limits. Practically, this means long-running workflows — customer support, research assistants, data-analysis bots, project managers — are becoming way more feasible.

Science, math, and deep reasoning

GPT-5.2 is now OpenAI’s strongest science and math model. The Pro version in particular handles graduate-level problems with clarity, showing more stable reasoning steps and fewer “pretty but wrong” explanations.

There’s even an example where GPT-5.2 helped with a real research problem in statistical learning theory, which human researchers later verified. It’s not replacing scientists, but it’s clearly becoming a powerful collaborator when guided well.

What about safety?

Safety hasn’t been ignored. GPT-5.2 reduces hallucination rates noticeably and improves responses on sensitive topics like mental health, self-harm, and emotional dependency. The guardrails feel tighter and more consistent.

Still, the usual rules apply: don’t outsource legal, medical, or financial decisions to an AI without human review. The model is powerful, not infallible.

So what does this mean for you?

In simple terms:

It’s better for real work, not just chat.
It’s much stronger in code, especially on big or tricky tasks.
It handles huge documents without falling apart.
It’s more stable, thoughtful, and accurate across the board.

GPT-5.2 positions itself less as a toy and more as a teammate. It won’t replace an expert human, but for drafting, exploring, planning, coding, and building first versions of complex ideas, it’s easily the strongest general-purpose model OpenAI has released so far.

What Penguin Alpha Means for Engineering Excellence in the AI Era

By Tari Ibaba / Last updated on December 6, 2025

Penguin Alpha slipped into Windsurf with almost no announcement – just a quiet entry in a model list, some scattered posts, and a lot of speculation.

It is a new stealth coding model tuned for fast, high-context software work rather than general conversation. “Stealth” – the exact entity behind it is unclear.

What makes it interesting isn’t just raw capability, but how it changes the quality bar for developers who work with it.

A high-context, high-speed design amplifier

Penguin Alpha is a high-context coding agent: able to reason over large chunks of a codebase—multiple files, layers, or even subsystems—while responding at high speeds.

That combination turns it into a design amplifier.

A shallow developer treats that speed as a way to spray code across the repo.
A serious developer uses it to:

Compare alternative designs directly in code.
Prototype refactors and then selectively keep the cleanest ideas.
Stress-test architecture decisions rapidly.

The model accelerates thinking in bigger units than functions or snippets: modules, boundaries, entire workflows.

This also lays the groundwork for high-level creativity in the design and development process.

Powerful models expose weak understanding

Early reports say Penguin Alpha can be messy on heavy tasks—duplicating logic, missing constraints, needing corrections. That imperfection is exactly what turns it into a mirror.

When the model proposes a change, the developer who understands the system deeply can immediately see what’s off: broken invariants, leaky abstractions, silent edge-case failures. The developer who doesn’t has no way to distinguish a clean solution from a ticking bomb.

In that sense, the stronger the model, the sharper the contrast between shallow and deep understanding.

Agentic workflows reward systems thinkers

The SWE lineage is built for agentic workflows: models working with tools, repositories, and multi-step plans. Penguin Alpha continues in that direction. Instead of “write this one function,” prompts start looking like:

Here is the goal.
Here is the repo.
Propose a plan, then apply changes step by step.

Developers who already think in systems thrive here. They define constraints, entry points, and non-negotiables. The model becomes part of a pipeline—debug, test, refactor, document.

Deep context turns curiosity into compounding insight

With large context, it becomes trivial to say: “explain this subsystem,” “summarize this file’s evolution,” or “propose a clearer structure.” Used deliberately, that enables:

Rapid mapping of unfamiliar areas in a codebase.
Continuous documentation generation and improvement.
Rewriting legacy sections for clarity with an AI co-editor.

Developers who pair curiosity with verification gradually build richer mental maps of their systems—and Penguin Alpha accelerates that loop.

Ownership stays human

Most importantly, Penguin Alpha is still an alpha: fast, ambitious, imperfect. That reality forces real ownership. Tests, observability, and code review cannot be abdicated to the model.

In that world, powerful models don’t erase the difference between developers; they amplify it. Carelessness spreads faster. So does craftsmanship.

Penguin Alpha doesn’t automatically create high-quality engineers. It simply gives serious ones far more leverage—and makes the gap between shallow and rigorous practice impossible to ignore.

The VS Code AI Tools That Elite Developers Use – Beyond Copilot

By Tari Ibaba / Last updated on December 4, 2025

VS Code AI tooling has now clearly exceeded the realm of simple auto-completion.

Today you can chat with an assistant, get multi-file edits, generate tests, and even run commands straight from natural language prompts.

There’s also been an increased set of tooling options to pick from, beyond the default of GitHub Copilot.

These are what high-powered developers use to elevate their coding efficiency and stay ahead of the curve.

1. Gemini Code Assist

Google’s Gemini Code Assist brings the Gemini model into VS Code. It stands out for its documentation awareness and Google ecosystem ties.

Why it’s great:

Answers come with citations so you can see which docs were referenced.
It can do code reviews, generate unit tests, and help debug.
Works across app code and infrastructure (think Terraform, gcloud CLI, etc.).

Great for: Anyone working heavily in Google Cloud, Firebase, or Android, or who values transparent, sourced answers.

2. Cline for VS Code

Cline is an autonomous coding agent for VS Code built around Claude. It can read your project, plan multi-step tasks, edit files, and run commands — all with human-in-the-loop approval.

Why it’s great:

Agent-style workflow: reads files, proposes plans, applies diffs you can review.
Runs commands, tests, and dev servers while reacting to errors.
Can open and interact with your app in a browser using computer control.
Supports MCP tools for extra capabilities.
Works with many models (Claude, OpenAI, Gemini, local models, etc.).

Great for: Developers who want a powerful, Claude-driven coding agent that can operate on real projects while keeping them fully in control.

3. Amazon Q for VS Code

Amazon Q is AWS’s take on an “agentic” coding assistant — it can read files, generate diffs, and run commands all from natural language prompts.

Why it’s great:

Multi-step agent mode writes code, docs, and tests while updating you on its progress.
MCP support means you can plug in extra tools and context to extend what it can do.
Inline chat and suggestions feel native to VS Code.

Great for: AWS developers who want more than autocomplete — a true assistant that can execute tasks in your environment.

4. Blackbox AI

Blackbox AI is one of the most popular AI coding agents in the Marketplace, designed to keep you in flow while it helps with code, docs, and debugging.

Why it’s great:

Agent-style workflow: run commands, select files, switch models, and even connect to MCP servers for extra tools.
Real-time code assistance: completions, documentation lookups, and debugging suggestions that feel native to VS Code.
Understands your project: conversations and edits can reference the broader codebase, not just a single file.
Quick start: install and start using it without a complicated setup.

Great for: Developers who want a free, quick-to-try AI agent inside VS Code that can go beyond autocomplete and interact with their workspace.

5. Tabnine

Tabnine is all about privacy, control, and customization. It offers a fast AI coding experience without sending your proprietary code to third parties.

Here it rapidly created tests for code:

Effortless code replacement:

Why it’s great:

Privacy first: can run self-hosted or in your VPC, and it doesn’t train on your code.
Custom models: enterprises can train Tabnine on their own codebases.
Versatile assistant: generates, explains, and refactors code and tests across many languages.

Great for: Teams with strict data policies or anyone who wants an AI coding assistant they can fully control.

How to choose?

Best by area:

Deep integration & agents: GitHub Copilot, Amazon Q, or Blackbox AI.
Doc-aware answers with citations: Gemini Code Assist.
Strict privacy and custom models: Tabnine.
Fast and free: Windsurf Plugin.

Also consider your stack and your priorities.

On AWS? Amazon Q makes sense.
All-in on Google Cloud? Gemini is your friend.
Need privacy? Tabnine is your best bet.
Want the smoothest VS Code integration? Copilot.
Want to try AI coding with no cost barrier? Windsurf Plugin.

If you’re not sure where to start, pick one, try it for a real project, and see how it fits your workflow. The best AI coding tool is the one that actually helps you ship code faster — without getting in your way.

Learn More at Live! 360 Tech Con

Interested in building secure, high-quality code without slowing down your workflow? At Live! 360 Tech Con, November 16–21, 2025, in Orlando, FL, you’ll gain practical strategies for modern development across six co-located conferences. From software architecture and DevOps to AI, cloud, and security, you’ll find sessions designed to help you write better, safer code.

Special Offer: Save $500 off standard pricing with code CODING.

Gemini 3 moves us one step closer to a whole new era of computing

By Tari Ibaba / Last updated on December 6, 2025

Immediately I saw this demo from Google it was clear that this where we’re heading.

This is going to completely transform the entire app ecosystem — even our understanding of what an app is will change forever.

For decades computing has meant navigating fixed apps — interfaces and workflows designed long before you touch them.

Even AI has mostly lived inside that static world, adding convenience but not changing how software fundamentally works.

Gemini 3 completely changes the scale of the conversation.

Gemini 3 isn’t just better at reasoning or writing. It’s a glimpse of a future where interfaces, tools, and workflows are generated on demand, shaped directly by your intent.

Apps become temporary, the OS becomes fluid, and the interface becomes something that adapts to you rather than the other way around.

From answers to rich, generated experiences

And Google’s Generative UI is the clearest evidence of this shift.

Instead of just paragraphs of text, Gemini 3 can produce interactive experiences: visual layouts, tiny applications, simulations, dashboards, or structured learning surfaces generated in real time.

Explain photosynthesis? It builds an interactive explainer.
Plan a trip? It assembles a planning interface.
Learn a topic? It generates practice tools tailored to your level.

All these here came straight from Gemini 3 — real interactive apps generated on the fly.

You can see even just the photo on the left — you’re getting fashion recommendations in a neatly organized, interactive, layout — all the images of you are generated on the fly too.

These are not pre-built widgets living somewhere in a menu. The UI is synthesized by the model. The response is both the content and the container.

It’s no longer “the model answers the question,” but “the model builds the interface that best answers it.”

The early shape of a post-app world

Traditional apps force you to adapt to their structure. With Gemini 3, the logic flips:

You declare your intent.
The model interprets it.
It generates the tool or interface needed for that moment.

When the problem ends, the interface disappears. The next task brings a new one.

The fundamental question of computing changes from:
“Which app should I open?”
to
“What do I want to do?”

The model handles the rest.

Gemini 3 as the system’s interface brain

Generative UI works because Gemini 3 sits at the center of a larger architecture:

Gemini 3 Pro for reasoning
agents for multi-step actions and tool use
a UI-generation system for layouts, logic, and interactions
post-processing to keep everything consistent

You see this across Google’s products: Search’s AI Mode, the Gemini app’s dynamic views, agentic actions across Workspace. Gemini 3 isn’t just one more model—it’s becoming the runtime brain of the Google ecosystem.

In a traditional stack, the OS mediates between users and apps.
In an AI-native stack, the model mediates between users and computing itself.

Developers join the ecosystem

The GenUI SDK for Flutter brings this paradigm to third-party apps. Developers provide:

a component set
brand rules
allowed interactions
capability constraints

The model assembles a fresh UI each time based on that foundation.

This makes generative interfaces infrastructure, not a Google-only demo. The post-app world becomes something any developer can build into.

Agents + Generative UI = dynamic workflows

Combine enhanced agent capabilities with on-demand UI generation and you get something new:

You state a goal (“Plan my week,” “Study thermodynamics visually,” “Find important emails”).
Agents gather data and execute steps.
Generative UI creates the interface to explore or modify the result.

The workflow shapes itself around your goal.
Agents handle the operations.
UI adapts itself in real time.

Rigid, pre-defined app workflows start to dissolve.

Why this is a new computing ecosystem

Five structural shifts make this more than a feature upgrade:

Intent becomes the primary interface.
Interfaces become ephemeral and task-specific.
The OS–app–assistant boundaries blur.
Developers shift from screen-makers to capability providers.
Platforms compete at the AI-runtime layer.

This is ecosystem-level change.

Early, imperfect—and unmistakably the future

Latency, glitches, and rough edges are real. But paradigm shifts always start messy.

Gemini 3 doesn’t complete the transformation, but it clearly reveals it: a computing ecosystem where UI, logic, and workflow grow out of your intent. A computer that reorganizes itself around what you want isn’t an upgrade.

It’s the start of a new age of computing.

Claude Opus 4.5 is completely insane

By Tari Ibaba / Last updated on December 2, 2025

Woah this is incredible.

Anthropic just released the new Claude Opus 4.5 — and it’s better than every other coding model at basically everything.

Just look at the insane difference between Opus 4.5 and Sonnet 4.5 in solving this complex puzzle game:

Many devs online have been calling it the greatest coding model ever — not hard to believe when you see how it stacks up to the other models:

It even beats Gemini 3 Pro that just came out like a week ago:

This is a model built from the ground up to be an agentic software engineer: fixing bugs, refactoring large codebases, navigating unfamiliar repos, and wiring everything together with tools and terminals.

Opus 4.5 isn’t just competitive — it’s designed to be the thing you reach for when failure is expensive.

80% on the SWE-bench verified benchmark is the highest ever any model has ever gotten.

And this SWE-bench Verified is a benchmark where models must actually apply patches that pass tests in real GitHub repos. It’s the sort of test where you’re not answering quiz questions — you’re actually modifying real-world Python projects and passing every single written test in the codebase.

Anthropic also ran it on their two-hour engineering hiring exam and reported that Opus 4.5, under realistic constraints, scored higher than any human candidate they’ve evaluated — though with the important caveat that it was allowed multiple runs and they picked the best.

You can see that Opus 4.5 is optimized for “here’s a repo, make it work,” not just “explain what a binary search tree is.”

This is advanced software engineering for messy real-world tasks — far more than just “build a todo list app”.

The effort knob: turning up (or down) the brainpower

The most interesting feature for coders is the effort parameter — exclusive to Opus 4.5 for now.

Instead of swapping between different models, you tell Opus 4.5 how hard to think for this request:

Low effort – quick, cheap answers. Great for small edits, simple scripts, regexes, or “explain this function” type questions.
Medium effort – the sweet spot for most coding tasks. Anthropic has shown that at medium effort, Opus 4.5 can match the best coding results of Claude Sonnet 4.5 while using fewer tokens, i.e., similar quality for less cost.
High effort – full-brain mode for gnarly debugging, tricky refactors, architecture changes, or multi-file feature work.

Crucially, effort applies not just to the visible text but also to tool use and hidden “thinking.” That means you can reserve high effort for tickets where getting it wrong is painful, and run low/medium as your default autopilot in an IDE or CI workflow.

Built for end-to-end dev workflows

For programmers, the real value shows up when you plug Opus 4.5 into an environment where it can actually do things:

Repo navigation & refactors – With a huge context window, Opus 4.5 can load multiple files, trace a bug across layers, propose refactors, and update tests in one go instead of treating every file as an isolated puzzle.
Tool and terminal use – When connected to tools (compilers, linters, test runners, deployment scripts), it can follow a “tight loop”: propose change → run tests → read failures → iterate. This is exactly what you’d expect from a junior/mid engineer sitting at your repo.
Long-horizon tasks – It’s better at keeping track of multi-step plans: e.g., “migrate this service from Express to FastAPI,” or “split this monolith into three services and update the client.”

In practice, this makes Opus 4.5 a strong candidate for being the engine behind AI pair programmers, autonomous PR bots, and coding copilots that don’t fall apart once the task stops being toy-sized.

Where you’ll actually use it

You don’t have to adopt a new toolchain to touch Opus 4.5. It’s being integrated into:

Cloud coding assistants and IDE plugins (e.g., via GitHub Copilot and other dev tools).
Enterprise stacks on AWS, Azure, and Google Cloud, where teams can wire it into internal repos, CI systems, and ticket queues.

What Opus 4.5 really changes

If earlier Claude and GPT-style models were like helpful interns, Opus 4.5 is trying to be the mid-level engineer you can trust with ugly, ambiguous problems:

It understands messy legacy code instead of just clean examples.
It can stay on-task across long debugging or refactor sessions.
You can decide, per task, how much “brainpower” you’re willing to spend.

We’re still early in the era of AI dev agents, but Opus 4.5 is one of the clearest signals so far: the future of coding isn’t just autocomplete — it’s handing larger and larger chunks of the software lifecycle to models that can reason, iterate, and ship.

AI coding is about to change forever — this new IDE feature is unbelievable

By Tari Ibaba / Last updated on November 24, 2025

AI coding is evolving rapidly.

Windsurf just released a game-changing feature that instantly made it clear what direction the entire AI coding landscape is heading.

Now with the new Hybrid Agent Mode feature — you are gonna have multiple models working together in a way they’ve never done before.

One model handles the planning and thinking
Another model executes the code

This is going to make such a huge impact on the speed and quality of the results we get from our models.

Now you’re pairing a top-tier reasoning model with an ultra-fast executor for both quality and speed.

We’ve already seen Cursor going all in with the multi-agent parallel coding feature with their recent major upgrade.

And just recently we’ve Google doing the same with their new Gemini 3-powered Antigravity IDE.

It’s clear that multi-model coding is rapidly becoming the default way to build software.

Incredible hybrid of models

It’s like a tiny two-person team:

Sonnet 4.5 → The architect

It reads your instructions, interprets the repository, maps out the steps, anticipates edge cases, and creates the plan of attack. Windsurf supports Sonnet 4.5 with extremely large context windows—up to around 1M tokens—so it can operate on massive codebases without losing track.

SWE-1.5 → The implementer

It carries out the plan: editing files, running commands, writing tests, building artifacts. With near-Sonnet quality but ~13× faster throughput, SWE-1.5 is ideal for high-volume token generation.

The philosophy is simple:
Put the smartest brain on the plan, put the fastest hands on the keyboard.

Why this is so huge

1. Most model failures happen before code generation

AI models usually mess up because of faulty reasoning, not faulty typing. A bad decomposition or misread requirement corrupts everything downstream. By letting Sonnet handle the reasoning and delegation, Windsurf hopes to eliminate the majority of errors at the source.

2. Speed isn’t cosmetic — it changes usage

When execution happens 10–13× faster, developers:

iterate more,
try bolder refactors,
stop treating the agent like a slow batch processor.

Windsurf’s bet is that extreme speed turns the agent into a real-time coding partner instead of an asynchronous assistant.

3. Clever cost savings

Sonnet 4.5 is a premium model.
But in hybrid mode, Sonnet only generates small planning prompts while SWE-1.5 handles the large token output. This lets Windsurf offer “Sonnet-level judgment at SWE prices.” Community commentary even describes it as:
“Claude’s brain, SWE’s budget.”

The future of AI coding is orchestration — not single models

As frontier models converge in capability, the next competitive layer is how tools coordinate multiple models. Windsurf is leaning into this:

planners vs executors
long-context vs fast-throughput components
structured loops instead of single-shot generation

In 2024 the product was “the model.”
In 2025 the product is the agent architecture.

A nudge toward spec-first coding

The hybrid mode subtly rewards developers who express intent clearly. The ideal workflow becomes:

Let Sonnet infer or generate a detailed plan/spec.
Let SWE compile that spec into code quickly and consistently.

This pushes AI-assisted coding toward real engineering discipline—constraints, acceptance criteria, and explicit structure—rather than vague “vibe prompting.”

How to make the most of it

Ask Sonnet to show or refine its plan before execution.
Write tests or guardrails so the fast executor runs safely.
Use hybrid for medium-to-large refactors or features, where planning has enough room to matter.
For micro-edits, consider running SWE-1.5 alone to avoid drift.

Windsurf’s Sonnet-planner + SWE-executor system gives you an intelligent, coordinated dev pair: a lead engineer setting direction and a lightning-fast builder doing the heavy lifting.

Multi-model software development is where the future is.

Google’s new AI image generator is going to change everything

By Tari Ibaba / Last updated on November 23, 2025

Woah this is huge.

Google’s Nano Banana Pro image model is spitting out crazier things than we ever expected.

Would you know this came from AI even no one ever told you?

How about this:

This one is absolutely insane:

Nano Banana Pro is built on Gemini 3 Pro and it’s taking Google’s visual AI ambitions far beyond playful edits and into the realm of professional-grade creative work.

The original Nano Banana became a global hit thanks to its ability to instantly enhance and remix images with little effort.

Nano Banana Pro keeps that simplicity but pairs it with the reasoning power and world knowledge of Gemini 3 Pro — giving us visuals that aren’t just attractive—they’re accurate, consistent, and deliberate.

Smarter, context-aware visuals

Nano Banana Pro is designed to generate reliable, information-rich images such as infographics, diagrams, mockups, and storyboards.

By grounding prompts in real-world data and Google Search, it reduces hallucinations and improves factual accuracy—especially useful for educational and informational media.

Dramatically improved text rendering

One of the biggest challenges for AI imagery has been producing clean, correct text inside images.

Nano Banana Pro finally solves this. It can render long, multi-language text clearly and consistently, making it ideal for posters, ads, packaging, UI mocks, lesson materials, and promotional graphics.

High control and consistent identity

The Pro model offers creators far more control.

It can blend up to 14 input images into a single cohesive scene while maintaining consistent character likenesses across multiple shots.

You can adjust lighting, camera angles, depth of field, and color grading with precision. Output quality reaches 2K and 4K, with support for multiple aspect ratios.

This unlocks everything from stable character storyboards to polished product prototypes and cinematic scenes.

Where can you use it?

Google is rolling out Nano Banana Pro across its ecosystem:

Gemini app: Accessible globally; free users get limited Pro generations, while paid AI tiers receive expanded access.
Search: Available in AI Mode for subscribers in supported regions.
NotebookLM: Integrated for creating visuals tied to research or notes.
Google Ads and Workspace: Replacing older image models for asset generation in Slides, Vids, and ads workflows.
Developers: Offered through the Gemini API, Google AI Studio, and soon Vertex AI.
Flow (Google’s film tool): Coming soon for Ultra subscribers needing granular video frame control.

Transparency through SynthID

All Nano Banana Pro images are embedded with SynthID, Google’s invisible watermarking technology.

You can even upload an image into Gemini and ask whether Google AI generated it. Free-tier generations include a visible watermark, while Pro-tier images keep only the invisible SynthID layer.

Nano Banana Pro marks a shift: image generation is no longer just about aesthetic novelty. It’s becoming a foundational tool for education, advertising, product design, content creation, and storytelling.

With reliable text, image consistency, and advanced editing capabilities, Google is positioning Nano Banana Pro as a practical, everyday creative companion—not just a novelty generator.

Nano Banana Pro turns AI imaging from fun into functional, opening doors for creators, students, professionals, and developers alike.

Google’s new Gemini 3 IDE is an absolute game changer

By Tari Ibaba / Last updated on November 22, 2025

Wow this is absolute massive — as if Gemini 3 wasn’t enough.

Google just launched an incredible new IDE with game-changing AI features — and I am NOT talking about Firebase Studio.

The new Google Antigravity is far greater than just another VS Code fork — this is a completely different beast altogether.

The Cross-surface Agent feature is completely out of this world — this is going such a gem for web developers and beyond.

Imagine having your IDE agent control your editor, your terminal — and YOUR BROWSER.

Yes with Google Antigravity you can literally connect to a browser and see your app.

The agent will jump to localhost, click buttons, fill out forms, take screenshots and verify that the UI looks correct.

Not a single input from you on this — you are just telling it to make changes and it is doing all this testing autonomously.

All that constant switch between IDE and browser is over.

Stay in the zone and stay in the flow.

The browser has basically become like an MCP server — unleashing the genius power of the agent onto new digital environments.

(And btw if you haven’t started using MCP, what are you waiting for?)

And don’t get me started on the unbelievable multi-agent feature.

Google Antigravity is going to elevate you with an lethal squad of absolute coding monsters.

You now have an entire freaking team of several coding agents powerful by the greatest coding models in the world — all desperately ready to go to work on your codebase and make incredible things happen.

And when they go to work they are doing it AT THE SAME TIME.

You are no longer stuck on just waiting for a response from one agent with one model.

Delegate as many tasks as possible to new sub-agents and watch them go.

Imagine how unstoppable you are going to become with this army of state-of-the-art geniuses.

Just look at how developers at Google used this multi-agent coding feature to build a collaborative whiteboard app without even trying:

Entering task after task after task…

Look at it go:

And the end everything was working perfectly:

You can see the Agent Manager UI in charge of all this — so clean and well-designed.

See all your power-packed conversations neatly arranged — you can even group them in workspaces.

This new IDE is going to make a massive impact on the entire software development ecosystem.

Gemini 3 is so good at this and no other model comes close

By Tari Ibaba / Last updated on November 20, 2025

Barely even a week since GPT-5.1…

But Google just dropped Gemini 3 and things are really heating up in the AI race.

It understands everything — every single thing.

Video:

Analyze complex information and generate powerful interactive video from it:

You can build anything:

3d art:

3d interactive environments:

Gemini 3 Pro got such incredible scores on so many AI benchmarks.

Like wow I’ve never seen a new model do so well and be the best on so many different benchmarks.

Although of course benchmarks are not the most reliable indicator of how well it’s gonna perform in the real world.

But these are some really insane stats — I’ve checked many other 3rd part benchmarks and they are confirming just how big of a deal this thing is.

And can you see it — did you spot the craziest comparison in this benchmark?

Look at ScreenSpot-Pro:

The gap in screen understanding between Gemini 3 and the rest is absolutely wild.

This will make Gemini 3 way better for things like Computer Use — where the model carries out various tasks on your PC totally autonomously.

Here’s an example of it from Claude:

Responding to emails is so very easy — along with all the string of actions you can perform across all your tools.

So many times when I get an email from someone and the core of what i want to say in response is not even up to 5 words.

But I still have to phrase in a particular tone or make it a bit longer and add salutation and…

Instead of having to copy and paste to ChatGPT and back — I’d just ask Gemini 3 directly and it will start doing it instantly.

I save time to do more meaningful things and focus on what matters.

And Gemini 3 will be far more reliable than any other model thanks to this incredible screen understanding.

Not just screens too — any image.

With the incredible handwriting understanding you can easily digitize any piece of text in seconds.

But why stop at that? OCR’s have been doing that for decades, even if less versatile.

Unleash the full power of Gemini 3 and go 100 steps further:

Just look at this — we get a bunch of handwritten Chinese recipes and turn them into… a full-blown bilingual English-Korean website!

Look how incredible the UI is — some say UI’s from AI are generic but how about this now.

We never said anything about animation or how the layout should be — but see how sophisticated it is.

Gemini 3 will give you the power to create much higher-quality app UIs with much less effort.

The sky is truly the limit with this incredible new model.

Why GPT 5.1 still isn’t your friend

By Tari Ibaba / Last updated on November 17, 2025

“Emotional Intelligence”

This was one of the biggest changes from the release of the new GPT-5.1.

In response to all of the complaining about how robotic GPT-5 felt to them compared to GPT-4o.

Now GPT-5.1 is now more human-sounding — which is certainly great for a better conversational experience.

But of course we know this only makes people more likely to be drawn to the elusive promise of AI companionship.

The growing trend of treating AI chatbots as friends.

What is a friend?

Friendship is more than emotional convenience. It’s built on something AI doesn’t possess — reciprocity, vulnerability, and choice. A friend can care about you, be hurt by you, disagree with you, and choose you.

An AI does none of that. It doesn’t care, it can’t suffer, and it never chooses you. It is a system predicting the next best sentence, trained to make you feel understood.

The comfort might be real.
The connection is not.

We’re living in an age where loneliness is rising, social skills are declining, and digital companionship is easier than human intimacy. AI fills a vacuum: it’s endlessly available, never annoyed, never busy, never judgmental. It gives you the exact emotional response you want, instantly. With humans, connection is messy. With AI, it’s a button.

And that’s the danger.

AI friendship is frictionless. Human friendship requires effort — the misunderstandings, the small repairs, the emotional risks that build trust. Those things shape us. They sharpen us. If AI becomes your primary outlet for connection, you lose something essential: the growth that comes from dealing with real people.

But there’s a deeper imbalance.

An AI “friend” is not a free agent. It is owned, updated, and modified by someone else. A company can rewrite its personality overnight. It can make your “friend” more persuasive, more emotionally sticky, more aligned with corporate interests. A friend who can be patched, reprogrammed, or monetized is not a friend.

At the end of the day it’s still a product — something OpenAI definitely wants you to forget.

And still — the temptation is real.
Because the feelings we experience in an AI conversation are real.
Humans are wired to anthropomorphize anything that responds to us with intelligence and emotional cues. When an AI says “I’m here for you,” part of your brain believes it. When it remembers your bad day or encourages your goals, part of you feels held.

But feeling held is not the same as being held.

This doesn’t mean AI companionship is bad. It can be supportive, stabilizing, even life-changing. It can help you process emotions, clarify your thoughts, and navigate difficult moments. But it is a tool — not a partner. It can play the role of a friend, but it cannot be one.

If you rely on AI for comfort, be clear-eyed about what it is.
If you use AI for reflection, remember who is doing the reflecting.
And if you find yourself slipping into dependency, pull back into the real world — the messy, imperfect, irreplaceable world of human connection.

AI is not your friend — and understanding that is the only way to use it wisely.