From owner-freebsd-questions@FreeBSD.ORG Tue Jun 9 03:21:52 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 41661106564A for ; Tue, 9 Jun 2009 03:21:52 +0000 (UTC) (envelope-from on@cs.ait.ac.th) Received: from mail.cs.ait.ac.th (mail.cs.ait.ac.th [192.41.170.16]) by mx1.freebsd.org (Postfix) with ESMTP id 9EE0B8FC17 for ; Tue, 9 Jun 2009 03:21:51 +0000 (UTC) (envelope-from on@cs.ait.ac.th) Received: from banyan.cs.ait.ac.th (banyan.cs.ait.ac.th [192.41.170.5]) by mail.cs.ait.ac.th (8.13.1/8.13.1) with ESMTP id n593HYPf080703 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Tue, 9 Jun 2009 10:17:34 +0700 (ICT) (envelope-from on@cs.ait.ac.th) Received: (from on@localhost) by banyan.cs.ait.ac.th (8.14.3/8.14.3/Submit) id n593LnOa088398; Tue, 9 Jun 2009 10:21:49 +0700 (ICT) (envelope-from on) Date: Tue, 9 Jun 2009 10:21:49 +0700 (ICT) Message-Id: <200906090321.n593LnOa088398@banyan.cs.ait.ac.th> From: Olivier Nicole To: djuatdelta@gmail.com In-reply-to: (message from Daniel Underwood on Mon, 8 Jun 2009 23:11:50 -0400) References: <20090609023702.EF4D2BED8@kev.msw.wpafb.af.mil> X-Virus-Scanned: on CSIM by amavisd-milter (http://www.amavis.org/) Cc: freebsd-questions@freebsd.org Subject: Re: PDF inventory software X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Jun 2009 03:21:52 -0000 > Since all the PDFs contain text (none are scanned "images"), can I > simply use some command like grep to search for text within the > collection? If so, how would I do this? Can grep read text from > within PDFs? pdftotext, comes with the port xpdf I think Olivier