textproc/p5-docx2txt: create port

docx2txt is a perl based command line utility to convert Microsoft Office(Tm)
Docx documents to equivalent Text documents. Latest version supports
following features during text extraction.

*	Character conversions (" ' < & > - ... fraction and some mathematical
	symbols etc.); currency characters are converted to respective names
	like Euro.
*	Capitalisation of text blocks.
*	Center and right justification of text fitting in a line of
	(configurable) 80 columns.
*	Horizontal ruler, line breaks, paragraphs separation, tabs
*	Indicating hyperlinked text along with the hyperlink. (configurable)
*	Handling (bullet, decimal, letter, roman) lists along with (attempt at)
	indentation.

WWW: https://sourceforge.net/projects/docx2txt/

PR:		225113
Submitted by:	Piotr Kubaj <pkubaj@anongoth.pl>
This commit is contained in:
Steve Wills 2019-01-05 18:05:38 +00:00
parent bde0aa65cc
commit 7ed43bddc7
Notes: svn2git 2021-03-31 03:12:20 +00:00
svn path=/head/; revision=489359
5 changed files with 51 additions and 0 deletions

View file

@ -1159,6 +1159,7 @@
SUBDIR += p5-YAPE-Regex-Explain
SUBDIR += p5-ack
SUBDIR += p5-dTemplate
SUBDIR += p5-docx2txt
SUBDIR += p5-libsoldout
SUBDIR += p5-libxml
SUBDIR += p5-pod2pdf

View file

@ -0,0 +1,28 @@
# $FreeBSD$
PORTNAME= docx2txt
PORTVERSION= 1.4
CATEGORIES= textproc perl5
MASTER_SITES= SF/${PORTNAME}/${PORTNAME}/v${PORTVERSION}
PKGNAMEPREFIX= p5-
MAINTAINER= pkubaj@anongoth.pl
COMMENT= Utility to convert Docx documents to equivalent Text documents
LICENSE= GPLv3
LICENSE_FILE= ${WRKSRC}/COPYING
USES= perl5 tar:tgz
NO_ARCH= yes
NO_BUILD= yes
pre-patch:
${REINPLACE_CMD} -e 's|/etc|${PREFIX}/etc|' ${WRKSRC}/docx2txt.pl
do-install:
${INSTALL_SCRIPT} ${WRKSRC}/docx2txt.pl ${STAGEDIR}${PREFIX}/bin
${INSTALL_SCRIPT} ${WRKSRC}/docx2txt.sh ${STAGEDIR}${PREFIX}/bin
${INSTALL_DATA} ${WRKSRC}/docx2txt.config ${STAGEDIR}${PREFIX}/etc/docx2txt.config.sample
.include <bsd.port.mk>

View file

@ -0,0 +1,3 @@
TIMESTAMP = 1515770114
SHA256 (docx2txt-1.4.tgz) = b297752910a404c1435e703d5aedb4571222bd759fa316c86ad8c8bbe58c6d1b
SIZE (docx2txt-1.4.tgz) = 31302

View file

@ -0,0 +1,16 @@
docx2txt is a perl based command line utility to convert Microsoft Office(Tm)
Docx documents to equivalent Text documents. Latest version supports
following features during text extraction.
* Character conversions (" ' < & > - ... fraction and some mathematical
symbols etc.); currency characters are converted to respective names
like Euro.
* Capitalisation of text blocks.
* Center and right justification of text fitting in a line of
(configurable) 80 columns.
* Horizontal ruler, line breaks, paragraphs separation, tabs
* Indicating hyperlinked text along with the hyperlink. (configurable)
* Handling (bullet, decimal, letter, roman) lists along with (attempt at)
indentation.
WWW: https://sourceforge.net/projects/docx2txt/

View file

@ -0,0 +1,3 @@
bin/docx2txt.pl
bin/docx2txt.sh
@sample etc/docx2txt.config.sample