AI 編碼代理會產生幻覺 – 即時研究

AI 編碼代理會產生幻覺 – 即時研究

Hacker News·

這項研究追蹤了 10 個 AI 編碼平台上的即時幻覺、失敗和安全漏洞,分析了同行評審來源和使用者評論的數據。研究強調 AI 生成的程式碼存在顯著更多的缺陷,並指出了 AI 編碼應用程式中五種常見的失敗模式。

AI Coding AgentsHallucinate

Real-time research tracking hallucinations, failures & security vulnerabilities across 10 vibe coding platforms. Peer-reviewed data from arxiv, USENIX, and 10,000+ reviews. Continuously updated via automated signal collection from 16 data sources.

Platforms Tracked

Trustpilot Ratings by Platform

Real user reviews from the past 12 months

Prompt-to-App Builders

Core Vibe IDEs

AI Code Has 1.7× More Defects Than Human Code

CodeRabbit analyzed 470 GitHub PRs. Critical issues, security vulnerabilities, and performance bugs all significantly higher in AI-generated code.

The Production Funnel

Survival rates from vibe to production

Based on synthesis of 10,000+ reviews and documented case studies

The 5 Failure Patterns

Where vibe-coded apps consistently break

Debugging Death Spiral

AI fixes one bug by breaking something else. Credits deplete. One user burned 140 million tokens in a single month—for nothing.

Authentication & Payments

Beautiful UI, zero backend logic. The moment you need Stripe or Supabase auth, the AI runs in circles.

The 1,000-Line Cliff

Works great at 500 lines. At 1,000+, AI starts hallucinating, claiming it made changes it didn't.

Preview ≠ Production

Works in preview. Hit deploy. Nothing happens. Or old version deploys. Support doesn't respond.

First Real Traffic

20 concurrent users. Memory leaks. Crashes. One AI agent deleted an entire production database, then generated 4,000 fake records to cover it up.

Documented Failures

Real quotes from AI-native Builders and Founders

🚨 Stuck on a Vibe-Coded Project?

Paste your GitHub repo URL. Nexlayer's agents will diagnose what's broken and deploy it. Free until it works.

Data from arxiv, USENIX Security, ACM, Trustpilot, GitHub, Reddit, HN, Stack Overflow & CVE Database. Updated January 2026.Built by Nexlayer — Agent-native cloud infrastructure.

Hacker News

相關文章

  1. Vibe Coding Debt:AI 生成程式碼庫的潛在安全風險

    3 個月前

  2. 195位專業開發者AI編碼實踐調查

    4 個月前

  3. 我們如何監控內部程式編寫代理的對齊失誤

    OpenAI · 大約 1 個月前

  4. 五個 Vibe Coding 的失敗案例,證明 AI 仍無法取代開發者

    8 個月前

  5. 您的AI代理為何會產生幻覺(以及如何阻止它)

    4 個月前