Цитирую:
"Kreuzberg is a document intelligence library that extracts structured data from 56+ formats, including PDFs, Office docs, HTML, emails, images and many more. Built for RAG/LLM pipelines with OCR, semantic chunking, embeddings, and metadata extraction."
Легко найдете на гите - лицензия MIT. Можно применять в РФ.
Русский ИТ бизнес

Комментарии (0)