維基百科上的AI生成內容：一個警示故事

Hacker News·4 個月前

本文探討了透過一個旨在修復ISBN參考的工具，意外在維基百科上發現AI生成內容的過程。文章強調了識別此類內容的挑戰，以及其創作和相關反應中的人為因素。

media.ccc.de

AI-generated content in Wikipedia - a tale of caution

Mathias Schindler

I successfully failed with a literature related project and accidentally built a ChatGPT detector. Then I spoke to the people who uploaded ChatGPT generated content on Wikipedia.

It began as a standard maintenance project: I wanted to write a tool to find and fix broken ISBN references in Wikipedia. Using the built-in checksum, this seemed like a straightforward technical task. I expected to find mostly typos. But I also found texts generated by LLMs. These models are effective at creating plausible-sounding content, but (for now) they often fail to generate correct checksums for identifiers like ISBNs. This vulnerability turned my tool into an unintentional detector for this type of content. This talk is the story of that investigation. I'll show how the tool works and how it identifies this anti-knowledge. But the tech is only half the story. The other half is human. I contacted the editors who had added this undeclared AI content. I will talk about why they did it and how the Wikipedians reacted and whether "The End is Nigh" calls might be warranted.

Licensed to the public under http://creativecommons.org/licenses/by/4.0

Download

This Talk was translated into multiple languages. The files available
for download contain all languages as separate audio-tracks. Most
desktop video players allow you to choose between them.

Please look for "audio tracks" in your desktop video player.

你的個人知識庫

維基百科上的AI生成內容：一個警示故事

AI-generated content in Wikipedia - a tale of caution

Download

Embed

Share:

Tags