ArchiTXT Documentation#
What is ArchiTXT ?#
ArchiTXT is an open source Python library that transforms unstructured text into structured, searchable, and AI-ready data, enabling automated database generation and seamless integration. It ensures transparency, interoperability, and trust by offering a fully auditable, fair, and unbiased data modeling process.
Key Features#
With ArchiTXT, you can:
Discover the best way to organise your text data in a database.
Automatically produce a database from a collection of texts.
Make it easier and quicker to store and search unstructured text.
Turn raw text into machine-learning-ready data.
Need help?
Check out the Getting Started guide or take a look at the Usage examples section to see ArchiTXT in action.
Explore the Documentation#
Installation
Getting Started
Examples
Integrations
Fundamentals
API Reference