Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 10 Aug 2022 19:14:37 +0000
From:      bugzilla-noreply@freebsd.org
To:        ports-bugs@FreeBSD.org
Subject:   [Bug 265768] [NEW PORT] textproc/py-textract: Extract text from any document
Message-ID:  <bug-265768-7788@https.bugs.freebsd.org/bugzilla/>

next in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265768

            Bug ID: 265768
           Summary: [NEW PORT] textproc/py-textract: Extract text from any
                    document
           Product: Ports & Packages
           Version: Latest
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: Individual Port(s)
          Assignee: ports-bugs@FreeBSD.org
          Reporter: DtxdF@riseup.net

Created attachment 235833
  --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=3D235833&action=
=3Dedit
textproc-py-textract-1.6.5.patch

textract provides a single interface for extracting content embedded
from Word documents, PowerPoint presentations, PDFs and much more,
which can be used for further textual analysis and visualization.

WWW: https://github.com/deanmalmgren/textract

portlint: looks fine.
poudriere: testport is ok: with all options enabled, without any option
enabled, and with default options enabled (including groups).

Requirements:

* audio/py-pocketsphinx [1]
* textproc/python-pptx [2]
* textproc/py-extract-msg [3]

[1] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265766
[2] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265763
[3] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265765

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-265768-7788>