Gemini 3's METR 50% time horizon
162
1.5kṀ160k
Dec 31
1%
<1.5h
3%
1.5h - 2h
8%
2h - 2.5h
10%
2.5h - 3h
29%
3h - 3.5h
30%
3.5h - 4h
10%
4h - 5h
6%
5h - 6h
1.1%
6h - 7h
0.6%
7h - 8h
0.3%
8h - 9h
0.1%
9h - 10h
0.1%
10h - 11h
0.1%
11h - 12h
0.1%
>=12h

This market will resolve to the highest 50% time horizon, as reported by METR, for any Gemini 3 model released within a month of the first Gemini 3 announcement.

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Left bounds inclusive, right bounds exclusive.

See also:

/jim/gpt-52-metr

/jim/claude-45-opuss-metr50-horizon (jim's version)

/Bayesian/claude-opus-45s-metr50-time-horizon (my version)

/Bayesian/gemini-3s-50-time-horizon-per-metr (this market)

/Bayesian/grok-5s-50-time-horizon-per-metr

/Bayesian/r2s-50-time-horizon-per-metr

Get
Ṁ1,000
to start trading!
Sort by:

@Bayesian Hello, why did the market got closed ?

why did it got closed ??? there is no answer wtf

@Amonium bc the close date was set too soon. fixed

bought Ṁ20 YES

@Bayesian Thank you.

Why are wee all in hold ?!

How does this resolve if METR doesn't evaluate any Gemini 3 model which is released within a month?

@jim I think the “within a month” thing means any model of Gemini’s released within a month of the first announcement, not METR’s analysis

@bens yes, but it's not guaranteed that any Gemini models which meet this condition will be evaluated by METR.

opened a Ṁ25,000 NO at 1.0% order

@jim i'll bet they will

but if they don't then obviously it would resolve to <1.5h jk it would resolve N/A

oh no they probably won't that's devastating i forgot they waited for general access before testing gemini 2.5 pro

sold Ṁ176 NO
opened a Ṁ7 YES at 28% order

@Bayesian what do you mean by general access?

@MaxLennartson The currently available model is gemini 3 pro preview. General access is when they remove all modifiers and sctually call the model gemini 3 pro in the api and such

@Bayesian It looks like they are calling it Gemini 3 pro.

@MaxLennartson They’re calling it thst to customers to keep it simple but the devs they re calling it gemini 3 pro preview

@Bayesian How long did it take before Gemini 2.5 became general access?

@MaxLennartson around 2-3 months iirc

@Bayesian yeah 2 months from 2.5 preview (but there was 2.5 experimental before that)

bought Ṁ10 YES

@Bayesian Do you think that METR will evaluate the ai models that have been released recently including Gemini 3?

@MaxLennartson not gemini 3, probably opus 4.5 though

@Bayesian Well I would assume that they are probably waiting for Gemini 3 to become general access.

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules