mirror of
https://git.freebsd.org/ports.git
synced 2025-06-03 03:46:30 -04:00
6 lines
259 B
Text
6 lines
259 B
Text
pdf2docx is a Python program and library to convert pdf to docx.
|
|
|
|
pdf2docx allows to:
|
|
* Extract data from PDF with PyMuPDF, e.g. text, images and drawings
|
|
* Parse layout with rule, e.g. sections, paragraphs, images and tables
|
|
* Generate docx with python-docx
|