Back to Projects
HanziFlow

HanziFlow

A comprehensive ecosystem of 6 interconnected tools designed for Mandarin Chinese study, all running locally on Apple Silicon.

Architecture

  • Writer — Character practice & dictionary. Built with Next.js, Hanzi Writer, CC-CEDICT.
  • Server — Backend API & data persistence. Built with Python, FastAPI, SQLite (WAL).
  • OCR — Screen text recognition. Built with PaddleOCR v5, Tkinter, Ollama.
  • Echo — Audio transcription pipeline. Built with mlx-whisper, yt-dlp, Ollama.
  • Talk — Real-time conversation. Built with mlx-whisper, Kokoro TTS, Sherpa-ONNX.
  • HSK Pipeline — Vocabulary ETL. Built with BeautifulSoup, SQLite. 11,545 entries processed.

Key Principles

  • 100% Local: No cloud APIs. All inference runs on-device via Ollama and MLX
  • Memory-Aware: Pipeline design for 16GB RAM constraint
  • Protocol-Based: Each tool follows a defined interface for composability