feature: modify UnstructuredFormatter and ExternalPDFFormatter description (#44)

* feature: add UnstructuredFormatter

* feature: add UnstructuredFormatter in db

* feature: add unstructured[docx]==0.18.15

* feature: support doc

* feature: add mineru

* feature: add external pdf extract operator by using mineru

* feature: mineru docker install bugfix

* feature: add unstructured xlsx/xls/csv/pptx/ppt

* feature: modify UnstructuredFormatter and ExternalPDFFormatter description

---------

Co-authored-by: Startalker <438747480@qq.com>
This commit is contained in:
Startalker
2025-10-31 10:32:14 +08:00
committed by GitHub
parent c6958d1511
commit a600c1d793
4 changed files with 8 additions and 8 deletions

View File

@@ -2,7 +2,7 @@
# -*- coding: utf-8 -*-
"""
Description: 外部PDF文本抽取
Description: MinerU PDF文本抽取
Create: 2025/10/29 17:24
"""
import json