Codex, OpenAI’s New Coding Agent, Wants to Be a World-Killer

Share This Post


Though artificial intelligence is taking the world by storm, it’s still pretty bad at tasks demanding a high-degree of flexibility, like writing computer code.

Earlier this year, ChatGPT maker OpenAI published a white paper taking AI to task for its lackluster performance in a coding scrum. Among other things, it found that even the most advanced AI models are “still unable to solve the majority” of coding tasks.

Later in an interview, OpenAI CEO Sam Altman said that these models are “on the precipice of being incredible at software engineering,” adding that “software engineering by the end of 2025 looks very different than software engineering at the beginning of 2025.”

It was a bold prediction without much substance to back it — if anything, generative AI like the kind Altman pedals has only gotten worse at coding as hallucination rates increase with each new iteration.

Now we know what he was playing at.

Early on Friday, OpenAI revealed a preview of Codex, the company’s stab at a specialty coding “agent” — a fluffy industry term that seems to change definitions depending on which company is trying to sell one to you.

“Codex is a cloud-based software engineering agent that can work on many tasks in parallel,” the company’s research preview reads.

The new tool will seemingly help software engineers by writing new features, debugging existing code, and answering questions about source code, among other tasks.

Contrary to ChatGPT’s everything-in-a-box model, which is geared toward the mass market, Codex has been trained to “generate code that closely mirrors human style and PR preferences.” That’s a charitable way to say “steal other people’s code” — an AI training tactic OpenAI has been sued for in the not-too-distant past, when it helped Microsoft’s Copilot go to town on open-source and copyrighted code shared on GitHub.

Thanks in large part to a technicality, OpenAI, GitHub, and Microsoft came out of that legal scuffle pretty much unscathed, giving OpenAI some convenient legal armor should it choose to go it alone with its own in-house model trained on GitHub code.

In the Codex release, OpenAI claims its coding agent operates entirely in the cloud, cut off from the internet, meaning it can’t scour the web for data like ChatGPT. Instead, OpenAI “limits the agent’s interaction solely to the code explicitly provided via GitHub repositories and pre-installed dependencies configured by the user via a setup script.”

Still, the data used to train Codex had to come from somewhere, and judging by the rash of copyright lawsuits that seem to plague the AI industry, it’s only a matter of time before we find out where.

More on OpenAI: ChatGPT Users Are Developing Bizarre Delusions



Source link

Related Posts

China slams US ‘bullying’ over new warnings on Huawei chips

Beijing condemned on Wednesday new US warnings on...

AMD unveils new Threadripper CPUs and Radeon GPUs for gamers at Computex 2025

During Computex 2025, Advanced Micro Devices held a...

NASA says long-running budget shortfalls may lead to ISS crew and research reductions

WASHINGTON — NASA says a “multi-year” budget shortfall...

Use NotebookLM to learn about I/O 2025

Google I/O 2025 was full of tons of...

Google I/O 2025: AI Mode in Search Gets Agentic Capabilities and a Shopping Experience

Google's latest artificial intelligence (AI) feature in Search,...

Gemini is now even smarter – New tools announced at Google I/O 2025

As Google I/O 2025 gets underway, today's keynote...
- Advertisement -spot_img