Show HN:DocuFlow – 開源事件驅動式 AI 發票導入管道

Show HN:DocuFlow – 開源事件驅動式 AI 發票導入管道

Hacker News·

DocuFlow 是一個開源、容器化、事件驅動的管道,旨在自動化發票和財務文件的導入、OCR、解析和分析流程。

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

To see all available qualifiers, see our documentation.

DocuFlow is a containerized, event-driven pipeline that automates ingestion, OCR, parsing, and analytics for invoices and financial documents.

License

Uh oh!

There was an error while loading. Please reload this page.

Shashank0701-byte/docuflow

Folders and files

Latest commit

History

Repository files navigation

DocuFlow — Intelligent Invoice Ingestion Pipeline

Image

Image

Image

Image

Image

DocuFlow is a containerized, event-driven pipeline that automates ingestion, OCR, parsing, and analytics for invoices and financial documents. It uses a lightweight microservice pattern so heavy OCR work runs asynchronously and does not block the UI.

Quick highlights:

🏗️ Architecture

The system follows a microservices pattern and is orchestrated with Docker Compose.

If GitHub renders the diagram you will see the flow below. If it doesn't, a plain ASCII fallback is provided right after.

Plain fallback (always visible):

Features

🛠️ Tech Stack

📂 Project Structure

Use the ASCII tree below — it's intentionally plain so it displays consistently across renderers.

Getting Started

Prerequisites

Clone the repository:

Build and start the stack:

Wait until the logs show Celery worker(s) ready and the watcher reporting polling mode. Then open:

How to ingest:

Notes:

🔧 Configuration & Environment

Services are configured via environment variables in docker-compose. Typical variables:

If you'd like, I can add a .env.example file with recommended values.

Troubleshooting

"New File Detected" doesn't trigger on Windows/WSL2:

Diagram won't render on GitHub (Mermaid parse error):

Dashboard shows empty charts:

Tesseract errors in Docker:

Duplicate invoice entries:

Dashboard Overview

The Streamlit UI includes:

Roadmap

Contributing

Contributions welcome! Please open an issue or a PR.

Guidelines:

License

This project is licensed under the MIT License. See the LICENSE file for full details.

About

DocuFlow is a containerized, event-driven pipeline that automates ingestion, OCR, parsing, and analytics for invoices and financial documents.

Resources

License

Uh oh!

There was an error while loading. Please reload this page.

Stars

Watchers

Forks

Releases

  1

Packages

  0

Uh oh!

There was an error while loading. Please reload this page.

Languages

Footer

Footer navigation

Hacker News

相關文章

  1. Show HN:用AI聊天生成發票,告別填寫表單

    4 個月前

  2. 加入我們一同建構 LoongFlow – 認知演化式 AI 框架

    3 個月前

  3. Show HN:CIE – 開源程式碼智慧引擎,專為 AI 編碼助手打造

    3 個月前

  4. Show HN:VeridisQuo – 具備可解釋 AI 的開源深度偽造偵測器

    3 個月前

  5. Show HN:開源AI計算網絡InfiniteGPU現已支援訓練

    3 個月前