ArchiTXT Documentation#

PyPI - Project Status PyPI - Latest Version PyPI - Supported Python Versions Software Heritage - ArchiTXT Source Code Archive

What is ArchiTXT ?#

ArchiTXT is an open source Python library that transforms unstructured text into structured, searchable, and AI-ready data, enabling automated database generation and seamless integration. It ensures transparency, interoperability, and trust by offering a fully auditable, fair, and unbiased data modeling process.

Key Features#

With ArchiTXT, you can:

  • Discover the best way to organise your text data in a database.

  • Automatically produce a database from a collection of texts.

  • Make it easier and quicker to store and search unstructured text.

  • Turn raw text into machine-learning-ready data.

Need help?

Check out the Getting Started guide or take a look at the Usage examples section to see ArchiTXT in action.

Explore the Documentation#

Installation

Installation

Getting Started

Getting Started

Examples

Usage examples

Integrations

Integrations

Fundamentals

Fundamentals

API Reference

architxt