Date: Wed, 10 Aug 2022 19:14:37 +0000 From: bugzilla-noreply@freebsd.org To: ports-bugs@FreeBSD.org Subject: [Bug 265768] [NEW PORT] textproc/py-textract: Extract text from any document Message-ID: <bug-265768-7788@https.bugs.freebsd.org/bugzilla/>
next in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265768 Bug ID: 265768 Summary: [NEW PORT] textproc/py-textract: Extract text from any document Product: Ports & Packages Version: Latest Hardware: Any OS: Any Status: New Severity: Affects Only Me Priority: --- Component: Individual Port(s) Assignee: ports-bugs@FreeBSD.org Reporter: DtxdF@riseup.net Created attachment 235833 --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=3D235833&action= =3Dedit textproc-py-textract-1.6.5.patch textract provides a single interface for extracting content embedded from Word documents, PowerPoint presentations, PDFs and much more, which can be used for further textual analysis and visualization. WWW: https://github.com/deanmalmgren/textract portlint: looks fine. poudriere: testport is ok: with all options enabled, without any option enabled, and with default options enabled (including groups). Requirements: * audio/py-pocketsphinx [1] * textproc/python-pptx [2] * textproc/py-extract-msg [3] [1] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265766 [2] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265763 [3] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265765 --=20 You are receiving this mail because: You are the assignee for the bug.=
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-265768-7788>