Show HN:DocuFlow – 開源事件驅動式 AI 發票導入管道
DocuFlow 是一個開源、容器化、事件驅動的管道,旨在自動化發票和財務文件的導入、OCR、解析和分析流程。
Navigation Menu
Search code, repositories, users, issues, pull requests...
Provide feedback
We read every piece of feedback, and take your input very seriously.
Saved searches
Use saved searches to filter your results more quickly
To see all available qualifiers, see our documentation.
DocuFlow is a containerized, event-driven pipeline that automates ingestion, OCR, parsing, and analytics for invoices and financial documents.
License
Uh oh!
There was an error while loading. Please reload this page.
Shashank0701-byte/docuflow
Folders and files
Latest commit
History
Repository files navigation
DocuFlow — Intelligent Invoice Ingestion Pipeline
DocuFlow is a containerized, event-driven pipeline that automates ingestion, OCR, parsing, and analytics for invoices and financial documents. It uses a lightweight microservice pattern so heavy OCR work runs asynchronously and does not block the UI.
Quick highlights:
🏗️ Architecture
The system follows a microservices pattern and is orchestrated with Docker Compose.
If GitHub renders the diagram you will see the flow below. If it doesn't, a plain ASCII fallback is provided right after.
Plain fallback (always visible):
Features
🛠️ Tech Stack
📂 Project Structure
Use the ASCII tree below — it's intentionally plain so it displays consistently across renderers.
Getting Started
Prerequisites
Clone the repository:
Build and start the stack:
Wait until the logs show Celery worker(s) ready and the watcher reporting polling mode. Then open:
How to ingest:
Notes:
🔧 Configuration & Environment
Services are configured via environment variables in docker-compose. Typical variables:
If you'd like, I can add a .env.example file with recommended values.
Troubleshooting
"New File Detected" doesn't trigger on Windows/WSL2:
Diagram won't render on GitHub (Mermaid parse error):
Dashboard shows empty charts:
Tesseract errors in Docker:
Duplicate invoice entries:
Dashboard Overview
The Streamlit UI includes:
Roadmap
Contributing
Contributions welcome! Please open an issue or a PR.
Guidelines:
License
This project is licensed under the MIT License. See the LICENSE file for full details.
About
DocuFlow is a containerized, event-driven pipeline that automates ingestion, OCR, parsing, and analytics for invoices and financial documents.
Resources
License
Uh oh!
There was an error while loading. Please reload this page.
Stars
Watchers
Forks
Releases
1
Packages
0
Uh oh!
There was an error while loading. Please reload this page.
Languages
Footer
Footer navigation
相關文章