From owner-freebsd-arm@FreeBSD.ORG Sat May 10 17:39:31 2014 Return-Path: Delivered-To: freebsd-arm@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id A51A3E4B; Sat, 10 May 2014 17:39:31 +0000 (UTC) Received: from gromit.grondar.org (grandfather.grondar.org [IPv6:2a01:348:0:15:5d59:5c20:0:2]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 479D48DD; Sat, 10 May 2014 17:39:31 +0000 (UTC) Received: from [2001:470:9174:1:5117:6932:e21:6b24] by gromit.grondar.org with esmtpsa (TLSv1:AES128-SHA:128) (Exim 4.80.1 (FreeBSD)) (envelope-from ) id 1WjBF2-000MxX-4G; Sat, 10 May 2014 18:39:28 +0100 From: Mark R V Murray Content-Type: multipart/signed; boundary="Apple-Mail=_49E0B075-E125-464D-B3DE-52759A4DB635"; protocol="application/pgp-signature"; micalg=pgp-sha512 Date: Sat, 10 May 2014 18:39:55 +0100 Subject: Fast cycle counter for ARM chips with SCC - patch for review. To: freebsd-arm Message-Id: <22E12094-E6B2-42F9-94AB-014A702D17F2@FreeBSD.org> Mime-Version: 1.0 (Mac OS X Mail 7.2 \(1874\)) X-Mailer: Apple Mail (2.1874) X-SA-Score: -1.0 X-BeenThere: freebsd-arm@freebsd.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: "Porting FreeBSD to ARM processors." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 10 May 2014 17:39:31 -0000 --Apple-Mail=_49E0B075-E125-464D-B3DE-52759A4DB635 Content-Type: multipart/mixed; boundary="Apple-Mail=_2C57E05C-866C-464D-934F-B4E6684D0F0C" --Apple-Mail=_2C57E05C-866C-464D-934F-B4E6684D0F0C Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=windows-1252 Hi * This patch makes the ARM6 kernels that have an SCC coprocessor (RPI and = WANDBOARD have them) get a MUCH better implementation of = get_cyclecount(9), but not a perfect one. The incrementing rate is good = (+- 1 per instruction), but its only 32 bits. Later, if there is = interest, I may wish to fix that with an overflow interrupt, but for now = its easily good enough for the kernel entropy harvesting service. Also, = its MUUUCH more efficient; a simple read rather can calling the internal = kernel binuptime(9) clock. Comments, please? I=92m keen to commit. M --=20 Mark R V Murray --Apple-Mail=_2C57E05C-866C-464D-934F-B4E6684D0F0C Content-Disposition: attachment; filename=arm_ccnt_counter.diff Content-Type: application/octet-stream; name="arm_ccnt_counter.diff" Content-Transfer-Encoding: 7bit Index: sys/arm/arm/cpufunc.c =================================================================== --- sys/arm/arm/cpufunc.c (revision 265841) +++ sys/arm/arm/cpufunc.c (working copy) @@ -1387,6 +1387,29 @@ } #endif /* CPU_ARM9E || CPU_ARM10 */ +#if defined(CPU_ARM1136) || defined(CPU_ARM1176) \ + || defined(CPU_MV_PJ4B) \ + || defined(CPU_CORTEXA) || defined(CPU_KRAIT) +static __inline void +cpu_scc_setup_ccnt(void) +{ + /* Set up the PMCCNTR register as a cyclecounter: + * Set PMUSERENR[0] to 1 to allow user read + * Set PMINTENCLR to 0x8000000F to block interrupts + * Set PMCR[0] to 1 to enable CCNT + * Set PMCNTENSET to 0x8000000F to enable counters */ + __asm volatile ("mcr p15, 0, %0, c9, c14, 0\n\t" + "mcr p15, 0, %1, c9, c14, 2\n\t" + "mcr p15, 0, %2, c9, c12, 0\n\t" + "mcr p15, 0, %3, c9, c12, 1\n\t" + : + : "r"(0x00000001), + "r"(0x8000000F), + "r"(0x00000007), + "r"(0x8000000F)); +} +#endif + #if defined(CPU_ARM1136) || defined(CPU_ARM1176) struct cpu_option arm11_options[] = { { "cpu.cache", BIC, OR, (CPU_CONTROL_IC_ENABLE | CPU_CONTROL_DC_ENABLE) }, @@ -1490,6 +1513,8 @@ /* And again. */ cpu_idcache_wbinv_all(); + + cpu_scc_setup_ccnt(); } #endif /* CPU_ARM1136 || CPU_ARM1176 */ @@ -1524,6 +1549,8 @@ /* And again. */ cpu_idcache_wbinv_all(); + + cpu_scc_setup_ccnt(); } #endif /* CPU_MV_PJ4B */ @@ -1571,6 +1598,8 @@ #ifdef SMP armv7_auxctrl((1 << 6) | (1 << 0), (1 << 6) | (1 << 0)); /* Enable SMP + TLB broadcasting */ #endif + + cpu_scc_setup_ccnt(); } #endif /* CPU_CORTEXA */ Index: sys/arm/include/cpu.h =================================================================== --- sys/arm/include/cpu.h (revision 265841) +++ sys/arm/include/cpu.h (working copy) @@ -14,11 +14,26 @@ static __inline uint64_t get_cyclecount(void) { +/* This '#if' asks the question 'Do we have a System Control Coprocessor?' */ +#if defined(CPU_ARM1136) || defined(CPU_ARM1176) \ + || defined(CPU_MV_PJ4B) \ + || defined(CPU_CORTEXA) || defined(CPU_KRAIT) + uint32_t ccnt; + uint64_t ccnt64; + + /* + * Read PMCCNTR. Curses! Its only 32 bits. + * TODO: Fix this by catching overflow with interrupt? + */ + __asm __volatile("mrc p15, 0, %0, c9, c13, 0": "=r" (ccnt)); + ccnt64 = (uint64_t)ccnt; + return (ccnt64); +#else /* No SCC, use binuptime(9). This is slooooow */ struct bintime bt; binuptime(&bt); return ((uint64_t)bt.sec << 56 | bt.frac >> 8); - +#endif } #endif --Apple-Mail=_2C57E05C-866C-464D-934F-B4E6684D0F0C Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=us-ascii --Apple-Mail=_2C57E05C-866C-464D-934F-B4E6684D0F0C-- --Apple-Mail=_49E0B075-E125-464D-B3DE-52759A4DB635 Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename=signature.asc Content-Type: application/pgp-signature; name=signature.asc Content-Description: Message signed with OpenPGP using GPGMail -----BEGIN PGP SIGNATURE----- Version: GnuPG/MacGPG2 v2.0.22 (Darwin) Comment: GPGTools - http://gpgtools.org iQCVAwUBU25kb958vKOKE6LNAQrk+wP+PmL6xGV284KMuynxLtUOt27mVoJaefBP Bc3KinklrFEmy25XDszy8JXYB9G/0mzJTSFy5KUL+2w5+4i9JJcnGYqvQSSoAR0o J+0CBtaLq1iGWDa+uDrLx66D2WwHIFnozgOxGfSvpUnlLmm3gMe4iPOxkFB5jwnI gwcq2Zp6+VM= =fJWB -----END PGP SIGNATURE----- --Apple-Mail=_49E0B075-E125-464D-B3DE-52759A4DB635--