R2's METR 50% time horizon | Manifold

R2's METR 50% time horizon

17

1.5kṀ8221

Dec 31

7%

<1.5h

15%

1.5h - 2h

19%

2h - 2.5h

16%

2.5h - 3h

13%

3h - 3.5h

10%

3.5h - 4h

6%

4h - 5h

4%

5h - 6h

2%

6h - 7h

1.2%

7h - 8h

1.2%

8h - 9h

1.2%

9h - 10h

1.2%

10h - 11h

1.2%

11h - 12h

1.2%

>=12h

This market will resolve to the highest 50% time horizon, as reported by METR, for any R2 model released within a month of the first R2 announcement.

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Left bounds inclusive, right bounds exclusive.

See also:

/jim/gpt-52-metr

/jim/claude-45-opuss-metr50-horizon (jim's version)

/Bayesian/claude-opus-45s-metr50-time-horizon (my version)
/Bayesian/gemini-3s-50-time-horizon-per-metr

/Bayesian/gpt5s-50-time-horizon-per-metr

/Bayesian/grok-5s-50-time-horizon-per-metr

/Bayesian/r2s-50-time-horizon-per-metr (this market)

Technical AI Timelines

Get

1,000

to start trading!

Sort by:

I just don't think they'll release R2 anymore and will just release V4 with both a thinking and nonthinking version like most labs are doing these days

If that happens, @traders do you agree it's fair to make it about V4 instead? ie if V4 is a reasoning model, R2 would refer to V4-thinking for the purpose of this market?

DeepSeek-R1: 27 mins, released 01-20-25 (SOTA since December was 39 mins)

DeepSeek-R1-0528: 31 mins, released 4 months later (SOTA since April was 1.5 hours)

quadrupling from 31 mins to > 2 hours in another 4 months seems (very) unlikely, not betting more because of uncertainty over when (if ever) it’ll be released.

People are also trading

Gemini 3's METR 50% time horizon

Kimi K3 Thinking's METR 50% time horizon

Grok 4.20's METR 50% time horizon

Claude Opus 4.5's METR-50 time horizon

Will GPT-5.2's METR 50% time horizon exceed 3 hours 30 minutes?

Will a model achieve a METR 50% time-horizon of 4+ hours by the end of 2025?

+6% 1d31% chance

Grok 5's METR 50% time horizon

Will the METR long-horizons have a >6 month doubling time for at least a 4 month period before 2026?

Opus 4.5's METR time horizon beats GPT-5.1's?

Best AI time horizon by August 2026, per METR?

Related questions

Gemini 3's METR 50% time horizon

Kimi K3 Thinking's METR 50% time horizon

Grok 4.20's METR 50% time horizon

Claude Opus 4.5's METR-50 time horizon

Will GPT-5.2's METR 50% time horizon exceed 3 hours 30 minutes?

Will a model achieve a METR 50% time-horizon of 4+ hours by the end of 2025?

Grok 5's METR 50% time horizon

Will the METR long-horizons have a >6 month doubling time for at least a 4 month period before 2026?

Opus 4.5's METR time horizon beats GPT-5.1's?

Best AI time horizon by August 2026, per METR?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules