HN提問:AI的進展是如何衡量的?

Hacker News·

這篇Hacker News的貼文向社群徵求衡量AI進展的方法、基準和研究方法,並承認現有方法的固有局限性和爭議性。

Image

I’m aware that all of these measures have limitations and that many are controversial or imperfect by design. I’m not assuming they’re “good” or that they cleanly map to real-world capability.

I’d love to hear:

  • What measures, benchmarks, or methodologies you think belong on this list

  • What you see as their key strengths and failure modes

  • How (or whether) you personally use them to interpret AI progress

My goal here is discovery and understanding, not to defend or attack any particular framework.

Image

Hacker News

相關文章

  1. Ask HN:衡量AI進展的主要指標是什麼?

    4 個月前

  2. 我們對 AI 基準測試的衡量方式有誤,這是正確的方法

    4 個月前

  3. HN提問:您如何追蹤AI領域的發展?

    4 個月前

  4. 給你的AI一場面試

    5 個月前

  5. METR AI基準:釐清時間跨度的局限性

    3 個月前