From nobody Wed Aug 10 19:14:37 2022 X-Original-To: ports-bugs@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4M304F3BZSz4Z4QH for ; Wed, 10 Aug 2022 19:14:37 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from mxrelay.nyi.freebsd.org (mxrelay.nyi.freebsd.org [IPv6:2610:1c1:1:606c::19:3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "mxrelay.nyi.freebsd.org", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4M304F0vTPz3d7C for ; Wed, 10 Aug 2022 19:14:37 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2610:1c1:1:606c::50:1d]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mxrelay.nyi.freebsd.org (Postfix) with ESMTPS id 4M304D743wzNSx for ; Wed, 10 Aug 2022 19:14:36 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org ([127.0.1.5]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id 27AJEaCx038524 for ; Wed, 10 Aug 2022 19:14:36 GMT (envelope-from bugzilla-noreply@freebsd.org) Received: (from www@localhost) by kenobi.freebsd.org (8.15.2/8.15.2/Submit) id 27AJEa4Z038523 for ports-bugs@FreeBSD.org; Wed, 10 Aug 2022 19:14:36 GMT (envelope-from bugzilla-noreply@freebsd.org) X-Authentication-Warning: kenobi.freebsd.org: www set sender to bugzilla-noreply@freebsd.org using -f From: bugzilla-noreply@freebsd.org To: ports-bugs@FreeBSD.org Subject: [Bug 265768] [NEW PORT] textproc/py-textract: Extract text from any document Date: Wed, 10 Aug 2022 19:14:37 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Ports & Packages X-Bugzilla-Component: Individual Port(s) X-Bugzilla-Version: Latest X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Only Me X-Bugzilla-Who: DtxdF@riseup.net X-Bugzilla-Status: New X-Bugzilla-Resolution: X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: ports-bugs@FreeBSD.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version rep_platform op_sys bug_status bug_severity priority component assigned_to reporter attachments.created Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated List-Id: Ports bug reports List-Archive: https://lists.freebsd.org/archives/freebsd-ports-bugs List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-ports-bugs@freebsd.org X-BeenThere: freebsd-ports-bugs@freebsd.org MIME-Version: 1.0 ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1660158877; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=DUYUbrSvOvMFIerrvGpXZj51wWmUmkc94QM8/ALGOW4=; b=r7T+5ArskXHRxJw1XYw4+7dWZtVcj/ZY8JHWMzi3H8XTeyo5X8tuFYJq7ubjWIxTqPhdxR pUtcZ+a3kVyiE2vMItIwQQXNJ74WfVdOJR/K5SCZ595Wni0H2A1YWlnKKwRCGRgUnTXJlM RhshzpcVPIZaeTDacMTMHRlVsC/ETkwKabo0nrm/GZ9MKXwLuAldYobfpUkgiKX5rgdF+z A1WcSnTRyAIAtFxd+VE9YaAiwU9bwMFqU8iHHxrdjjg0iFYzhPjGv/PRdZtDC7XOQVlNdg xMFSbcadCbLf+ITe9RZvhB3AT9yUyd9l3qBDm41tbQpk+8JhKmMMb+0Mhpfo/g== ARC-Seal: i=1; s=dkim; d=freebsd.org; t=1660158877; a=rsa-sha256; cv=none; b=GirM+6AMnAtHcps5Xps+3a1V9ybXZPXwNT/fLl7odBj+kzUgW8wOuLEqZQe7F/RAdPbZZY ah93dv0dlqZZMMbob1CXUxUuY3A8vepiJAYEe6ZP0POtzdjkhGY6Z1C6e++ADP/5KXDT79 u7D6GWR6GkL1OSqlXGqW6VBu5AmOvwj85ZCjXOAsi4uk3gEkb3RWeqJX0t5TRrpG1gzW2O NvCd9v+zgUVf0YIDZsiHGBdD+uypoi2i1hchfUgpQ4tkODCgrQEGddT4rv+up+8F6Dt7G7 rZ3J5m1iSgogcwvqY25EcYDMR2zOM9UjuXgIU04vC4K9FF5GCylKc7LaFnRnzA== ARC-Authentication-Results: i=1; mx1.freebsd.org; none X-ThisMailContainsUnwantedMimeParts: N https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265768 Bug ID: 265768 Summary: [NEW PORT] textproc/py-textract: Extract text from any document Product: Ports & Packages Version: Latest Hardware: Any OS: Any Status: New Severity: Affects Only Me Priority: --- Component: Individual Port(s) Assignee: ports-bugs@FreeBSD.org Reporter: DtxdF@riseup.net Created attachment 235833 --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=3D235833&action= =3Dedit textproc-py-textract-1.6.5.patch textract provides a single interface for extracting content embedded from Word documents, PowerPoint presentations, PDFs and much more, which can be used for further textual analysis and visualization. WWW: https://github.com/deanmalmgren/textract portlint: looks fine. poudriere: testport is ok: with all options enabled, without any option enabled, and with default options enabled (including groups). Requirements: * audio/py-pocketsphinx [1] * textproc/python-pptx [2] * textproc/py-extract-msg [3] [1] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265766 [2] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265763 [3] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D265765 --=20 You are receiving this mail because: You are the assignee for the bug.=