From owner-freebsd-questions@freebsd.org Sat Jan 23 09:04:29 2021 Return-Path: Delivered-To: freebsd-questions@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 768AA4EC342 for ; Sat, 23 Jan 2021 09:04:29 +0000 (UTC) (envelope-from 4250.82.1d4c900003bc0af.8edaa44b697146d712282f84709b4e9b@email-od.com) Received: from s1-b0c6.socketlabs.email-od.com (s1-b0c6.socketlabs.email-od.com [142.0.176.198]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4DN9DX1H2fz3hX0 for ; Sat, 23 Jan 2021 09:04:27 +0000 (UTC) (envelope-from 4250.82.1d4c900003bc0af.8edaa44b697146d712282f84709b4e9b@email-od.com) DKIM-Signature: v=1; a=rsa-sha256; d=email-od.com;i=@email-od.com;s=dkim; c=relaxed/relaxed; q=dns/txt; t=1611392668; x=1613984668; h=content-transfer-encoding:content-type:mime-version:references:in-reply-to:message-id:subject:cc:to:from:date:x-thread-info; bh=gdOkTQNGMuL5GBlqz8NLks47wzm7FK4rhrsNUOPRgec=; b=k0gm/Ny2R/kOOcUqjO1feKc9SttCpRXjKeXCddSVFhXVgDHcZ42y3GReHi/Csw1rGqoCnnl+594PkulEakahE56yO++QdFje04BJc6woE9kc2Q3OFxL/MGJmVD97WHtkJzHmkMyceRAaXEu+p9L4nwxChFD5fdPJOUmk3UY/SvA= X-Thread-Info: NDI1MC4xMi4xZDRjOTAwMDAzYmMwYWYuZnJlZWJzZC1xdWVzdGlvbnM9ZnJlZWJzZC5vcmc= Received: from r2.us-east-1.aws.in.socketlabs.com (r2.us-east-1.aws.in.socketlabs.com [142.0.191.2]) by mxsg2.email-od.com with ESMTP(version=Tls12 cipher=Aes256 bits=256); Sat, 23 Jan 2021 04:04:23 -0500 Received: from smtp.lan.sohara.org (EMTPY [185.202.17.215]) by r2.us-east-1.aws.in.socketlabs.com with ESMTP(version=Tls12 cipher=Aes256 bits=256); Sat, 23 Jan 2021 04:04:23 -0500 Received: from [192.168.63.1] (helo=steve.lan.sohara.org) by smtp.lan.sohara.org with smtp (Exim 4.94 (FreeBSD)) (envelope-from ) id 1l3EqD-00068V-T1; Sat, 23 Jan 2021 09:04:21 +0000 Date: Sat, 23 Jan 2021 09:04:21 +0000 From: Steve O'Hara-Smith To: freebsd-questions@freebsd.org Cc: Polytropon Subject: Re: Convert PDF to Excel Message-Id: <20210123090421.7fb3ede1754fe280b685f83c@sohara.org> In-Reply-To: <20210123094041.f932fd4c.freebsd@edvax.de> References: <20210123054209.f03ac420.freebsd@edvax.de> <20210123094041.f932fd4c.freebsd@edvax.de> X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.33; amd64-portbld-freebsd12.1) X-Clacks-Overhead: "GNU Terry Pratchett" Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: 4DN9DX1H2fz3hX0 X-Spamd-Bar: -- Authentication-Results: mx1.freebsd.org; dkim=pass header.d=email-od.com header.s=dkim header.b=k0gm/Ny2; dmarc=none; spf=pass (mx1.freebsd.org: domain of 4250.82.1d4c900003bc0af.8edaa44b697146d712282f84709b4e9b@email-od.com designates 142.0.176.198 as permitted sender) smtp.mailfrom=4250.82.1d4c900003bc0af.8edaa44b697146d712282f84709b4e9b@email-od.com X-Spamd-Result: default: False [-2.70 / 15.00]; MID_RHS_MATCH_FROM(0.00)[]; ARC_NA(0.00)[]; R_DKIM_ALLOW(-0.20)[email-od.com:s=dkim]; NEURAL_HAM_MEDIUM(-1.00)[-1.000]; FROM_HAS_DN(0.00)[]; TO_DN_SOME(0.00)[]; MV_CASE(0.50)[]; R_SPF_ALLOW(-0.20)[+ip4:142.0.176.0/20]; MIME_GOOD(-0.10)[text/plain]; DMARC_NA(0.00)[sohara.org]; RBL_DBL_DONT_QUERY_IPS(0.00)[142.0.176.198:from]; SPAMHAUS_ZRD(0.00)[142.0.176.198:from:127.0.2.255]; RCVD_COUNT_THREE(0.00)[4]; TO_MATCH_ENVRCPT_SOME(0.00)[]; DKIM_TRACE(0.00)[email-od.com:+]; RCPT_COUNT_TWO(0.00)[2]; RCVD_IN_DNSWL_NONE(0.00)[142.0.176.198:from]; NEURAL_HAM_SHORT(-1.00)[-1.000]; NEURAL_HAM_LONG(-1.00)[-1.000]; FORGED_SENDER(0.30)[steve@sohara.org,4250.82.1d4c900003bc0af.8edaa44b697146d712282f84709b4e9b@email-od.com]; MIME_TRACE(0.00)[0:+]; RCVD_TLS_LAST(0.00)[]; ASN(0.00)[asn:7381, ipnet:142.0.176.0/22, country:US]; FROM_NEQ_ENVFROM(0.00)[steve@sohara.org,4250.82.1d4c900003bc0af.8edaa44b697146d712282f84709b4e9b@email-od.com]; MAILMAN_DEST(0.00)[freebsd-questions] X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 23 Jan 2021 09:04:29 -0000 On Sat, 23 Jan 2021 09:40:41 +0100 Polytropon wrote: > They contain text, so the OCR problem is out of the way. > Sadly, the text is re-arranged so the optimal solution (one > line in a table equals one line of text, with the columns > being separated by whitespace) does not appear, instead it > is the other way round: one line equals one column. I spy a fun interview question buried in this problem - flipping a text file like that efficiently is far from easy - dead easy if you don't mind eating memory of course. -- Steve O'Hara-Smith