Openai launches a new O3 -mini model -how to try a free Chatgpt user as follows:

Openai

On the last day of Openai’s “SHIPMAS”, the company announced the latest model O3 and O3-MINI. This exceeded O1 with a series of benchmarks, including mathematics and science, and further exceeded O1. At the beginning, Openai CEO’s Sam Altman said that O3 was scheduled to fall at the end of January, and today the company had done a good promise.

O3-mini

Friday, Openai release The O3-Mini model, the most cost-effective model in the Openai inference series, is open to the public. Until now, the series was composed of O1 and O1-mini. Like the company’s model, this model is particularly powerful in science, mathematics and coding.

Openai O3-mini is now available for Chatgpt and API.
Pro users have unlimited access to O3-mini, and PLUS & Team users have triple with rate restrictions (VS O1-mini).
Free users under the message composer[理由]You can select a button and try O3-mini with Chatgpt.

-Openai (@openai) January 31, 2025

When O3-mini is selected, a moderate inference effort to balance speed and accuracy is used. The original O1 model still has a wider range of knowledge than O3-mini, but the main advantage of the new model is higher and higher than O1-MINI.

Benchmark performance

Comparing O3-mini’s performance with O1-MINI, expert testers have discovered that O3-mini provides a more clear and more reasonable response than O1-mini. According to the post, they prefer 56 % of the O3-mini response and observed that the major errors had decreased by 39 %.

Beyond the evaluation of human preference, some STEM benchmarks, including competitive mathematics (Aime 2024), PHD-level scientific questions (GPQA diamonds), competitive code (CodeForces), O3-mini, including medium-level inference. Obtained by default -O1 -MINI outpoint.

Competition Mathematics benchmark — Openai

Also noteworthy is that, as you can see on the above AIME 2024 and software engineering (SWE bench verification) benchmarks, high inference efforts on benchmarks approach the O1 performance, sometimes exceeding it. The O3-mini model with moderate inference efforts matched the O1 performance on the Codeforces benchmark.

Safety

OPENAI evaluated the safety of O3-mini through public release through the approved content evaluation permitted to jailbreak. The company discovered that the model greatly exceeded the GPT-4O for the evaluation. Openai posted the evaluation results below and released the O3-mini system card. Page 37 PDF This includes detailed results of the evaluation.

How to access

All Openai paid tier, including Chatgpt Plus, Team, and Pro, can access Openai O3-MINI from today. Plus and team users will tripled the rate restrictions, 150 messages per day from 50 messages per day using O1-mini. Chatgpt Enterprise access is within a week.

Also: Copilot’s powerful new “Think Deeper” function is free for all users -how it works

The O3-mini model replaces the model picker’s O1-mini to help the same task. At the time of writing, as a paid user, I still have not yet access to O3-mini, and the O1-mini option is still displayed instead.

If you don’t have a subscription, don’t worry. You can see if O3-mini is worth the hype from a free account. What a free Chatgpt user has to do is click the “reason” in the message text box or play the response. Openai CEO Sam Altman confirmed free access in A Post to X。 Until now, all inference models have been retained behind the paywall. Openai did not specify a new model restriction for free users.

Contents

O3-mini Benchmark performance Safety How to access

Openai launches a new O3 -mini model -how to try a free Chatgpt user as follows:

O3-mini

Benchmark performance

Safety

How to access

Leave a Reply Cancel reply

Follow US

Popular News

Ryan Seacrest replaces Pat Sajak in revealing first day of filming for Wheel of Fortune

Global Coronavirus Cases

Importent Links

About US

Quick Links

Categories & Tags

Subscribe US

O3-mini

Benchmark performance

Safety

How to access

You Might Also Like

Risk of staying in Windows 10 after end of support (EOS)

FDA launches AI tools for employees

Suding Game reveals high in Life2 for winter release

I tested the pixel tablet without a google app, but it’s more private than the iPad

Asobo Studio’s Next Plague Tale game is the first part of the game, arriving in 2026.

Leave a Reply Cancel reply

Follow US

Weekly Newsletter

Popular News

Ryan Seacrest replaces Pat Sajak in revealing first day of filming for Wheel of Fortune

Global Coronavirus Cases

Importent Links

About US

Quick Links

Categories & Tags

Subscribe US