Date: Thu, 19 Mar 2020 11:00:42 +0200 From: Toomas Soome <tsoome@me.com> To: Mark Millard <marklmi@yahoo.com> Cc: freebsd-arm <freebsd-arm@freebsd.org>, FreeBSD Current <freebsd-current@freebsd.org> Subject: Re: head -r538966 on OrangePi+ 2ed: boot loader crashes when USB drive is present at power-on/boot: its a misaligned access by code from -r354746 Message-ID: <26DEEA3C-C880-410C-BCB4-CE1314EAFA02@me.com> In-Reply-To: <30A666F0-8773-41B7-AD37-7E319AA510CA@yahoo.com> References: <863312E1-4216-49BA-A623-CAC85F123655@yahoo.com> <30A666F0-8773-41B7-AD37-7E319AA510CA@yahoo.com>
next in thread | previous in thread | raw e-mail | index | archive | help
I think this should fix it. We need to create copy of dos partition = array, so we will get proper alignment.=20 tsoome@freebsd-2:/usr/src % svn diff stand/common/part.c Index: stand/common/part.c =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D --- stand/common/part.c (revision 359099) +++ stand/common/part.c (working copy) @@ -654,6 +654,7 @@ int has_ext; #endif table =3D NULL; + dp =3D NULL; buf =3D malloc(sectorsize); if (buf =3D=3D NULL) return (NULL); @@ -708,7 +709,11 @@ goto out; } /* Check that we have PMBR. Also do some validation. */ - dp =3D (struct dos_partition *)(buf + DOSPARTOFF); + dp =3D malloc(NDOSPART * sizeof(struct dos_partition)); + if (dp =3D=3D NULL) + goto out; + bcopy(buf + DOSPARTOFF, dp, NDOSPART * sizeof(struct = dos_partition)); + /* * In mac we can have PMBR partition in hybrid MBR; * that is, MBR partition which has DOSPTYP_PMBR entry defined = as @@ -770,6 +775,7 @@ #endif /* LOADER_MBR_SUPPORT */ #endif /* LOADER_MBR_SUPPORT || LOADER_GPT_SUPPORT */ out: + free(dp); free(buf); return (table); } tsoome@freebsd-2:/usr/src % > On 19. Mar 2020, at 10:20, Mark Millard <marklmi@yahoo.com> wrote: >=20 > [I built with -DDEBUG -DDISK_DEBUG -DPART_DEBUG and show > with some extra output as well. It shows that a misaligned > access causes the crash. The access in question is from > head -r354746 code additions.] >=20 > On 2020-Mar-18, at 13:36, Mark Millard <marklmi at yahoo.com> wrote: >=20 >> Without a USB drive present at power-on or >> boot, the OPi+2e boots fine. (The USB drives >> involved have a partition holding a ufs file >> system and a partition holding a swap/aging >> space.) >>=20 >> In all cases below, /boot/ is from the microsd >> card. But the intended configuration is for >> vfs.root.mountfrom to be used to direct the >> stages after kernel-booting to the USB drive. >>=20 >> The output sequence related to the crash when >> the USB drive is present looks like: >>=20 >> QUOTE >> . . . >> END QUOTE >=20 > I'm replacing the original quote with > better information. >=20 > First I quote the definition of dos_partition > for reference: >=20 > struct dos_partition { > unsigned char dp_flag; /* bootstrap flags */ > unsigned char dp_shd; /* starting head */ > unsigned char dp_ssect; /* starting sector */ > unsigned char dp_scyl; /* starting cylinder */ > unsigned char dp_typ; /* partition type */ > unsigned char dp_ehd; /* end head */ > unsigned char dp_esect; /* end sector */ > unsigned char dp_ecyl; /* end cylinder */ > uint32_t dp_start; /* absolute starting sector = number */ > uint32_t dp_size; /* partition size in sectors */ > }; >=20 > Note that access to dp_start or dp_size > requires address%4=3D=3D0 alignment but the > other fields do not. This is important for > &dp[i].dp_start reported in the below (and > then dp[i].dp_start is accessed in the code). > I also had it report dp, which shows that > dp has dp%4!=3D0 as well. >=20 >=20 >=20 > Here is with -DDEBUG and -DDISK_DEBUG -DPART_DEBUG > in place for the code (with some extra debug output > added): >=20 > FreeBSD/armv7 U-Boot loader, Revision 1.3 >=20 > signature: > version =3D 1 > checksum =3D 0x98de198b > sc entry =3D 0xbdf8cb01 >=20 > addresses info: > _etext (sdata) =3D 0x4204fdb4 > _edata =3D 0x4205ebb8 > __sbss_start =3D 0x4205ec28 > __sbss_end =3D 0x4205ec28 > __sbss_start =3D 0x4205ec28 > _end =3D 0x42062aa0 > syscall entry =3D 0xbdf8cb01 > DRAM: 2048MB > Number of U-Boot devices: 2 > U-Boot env: loaderdev not set, will probe all devices. > stor_init(): storage devices found: 2 > Found U-Boot device: disk > Probing all <unknown> devices... > Checking unit=3D0 slice=3D<auto> partition=3D<auto>...disk_open: = disk0s0: unit 0, slice 0, partition -2 =3D> 0x420631c0 > stor_readdev(): reading blk=3D0 size=3D1 @ 0x420633c0 > stor_readdev(): reading blk=3D64 size=3D1 @ 0x420636c0 > stor_readdev(): reading blk=3D1 size=3D1 @ 0x420636c0 > ptable_open: BEFORE NDOSPART loop #1 > ptable_open: dp=3D0x4206357e > ptable_open: IN NDOSPART loop #1 after 1st if > ptable_open: IN NDOSPART loop #1: &dp[i].dp_start=3D0x42063586 > data abort > pc : [<42009350>] lr : [<42009348>] > reloc pc : [<ce07d350>] lr : [<ce07d348>] > sp : b9f649e8 ip : 4205e200 fp : b9f64a18 > r10: 42063640 r9 : b9f64a50 r8 : 420633c0 > r7 : 420636c0 r6 : 4200818c r5 : 4205cd95 r4 : 42063586 > r3 : bdf8cdf7 r2 : 0000000a r1 : 01c28000 r0 : 000000ee > Flags: nZCv IRQs off FIQs off Mode SVC_32 > Code: e08f0000 eb0114af e5d801c2 e35000ee (05940000)=20 > Resetting CPU ... >=20 > resetting ... >=20 >=20 > My extra messages are from: >=20 > # svnlite diff /usr/src/stand/common/part.c > Index: /usr/src/stand/common/part.c > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > --- /usr/src/stand/common/part.c (revision 358966) > +++ /usr/src/stand/common/part.c (working copy) > @@ -715,18 +715,26 @@ > * start sector 1. After DOSPTYP_PMBR, there may be other = partitions. > * UEFI compliant PMBR has no other partitions. > */ > + DPRINTF("BEFORE NDOSPART loop #1"); > + DPRINTF("dp=3D%p", (void*)dp); > for (i =3D 0; i < NDOSPART; i++) { > if (dp[i].dp_flag !=3D 0 && dp[i].dp_flag !=3D 0x80) { > DPRINTF("invalid partition flag %x", = dp[i].dp_flag); > goto out; > } > + DPRINTF("IN NDOSPART loop #1 after 1st if"); > + DPRINTF("IN NDOSPART loop #1: &dp[i].dp_start=3D%p", = (void*)&dp[i].dp_start); > #ifdef LOADER_GPT_SUPPORT > if (dp[i].dp_typ =3D=3D DOSPTYP_PMBR && dp[i].dp_start = =3D=3D 1) { > + DPRINTF("BEFORE table->type assignment for PMBR = detected"); > + DPRINTF("table=3D%p for PMBR detected", = (void*)table); > table->type =3D PTABLE_GPT; > DPRINTF("PMBR detected"); > } > + DPRINTF("IN NDOSPART loop #1 after 2nd if"); > #endif > } > + DPRINTF("AFTER NDOSPART loop #1"); > #ifdef LOADER_GPT_SUPPORT > if (table->type =3D=3D PTABLE_GPT) { > table =3D ptable_gptread(table, dev, dread); >=20 >=20 > Note that "IN NDOSPART loop #1 after 2nd if" is > never reported. Nor is "BEFORE table->type > assignment for PMBR detected". The condition in > the 2nd if never completes its evaluation because > of the misaligned access involved in dp[i].dp_start > being evaluated. >=20 >=20 >> Stopping it earlier and using: >>=20 >> setenv loaderdev mmc 0 >> boot >>=20 >> avoids the problem because it avoids "probing" the >> USB drive at the stage indicated above. >=20 > Yep: The above loop is then not evaluated. >=20 >> But the >> boot then actually uses: >>=20 >> vfs.root.mountfrom=3D"ufs:/dev/gpt/BPIM3root" >>=20 >> in the /boot/loader.conf on the microsd card. >> ufs:/dev/gpt/BPIM3root is a reference to the ufs >> partition on the USB media. (With the mountfrom >> commented out the microsd card is bootable by >> itself.) So later the USB drive is put to use >> successfully when the initial probing is avoided. >>=20 >> Looking in the /usr/src/stand/uboot/common/main.c >> code shows: >>=20 >> static int >> probe_disks(int devidx, int load_type, int load_unit, int load_slice, >> int load_partition) >> { >> . . . >> if (load_unit =3D=3D -1) { >> printf(" Probing all %s devices...\n", = device_typename(load_type)); >> /* Try each disk of given type in succession until one = works. */ >> for (unit =3D 0; unit < UB_MAX_DEV; unit++) { >> currdev.dd.d_unit =3D = uboot_diskgetunit(load_type, unit); >> if (currdev.dd.d_unit =3D=3D -1) >> break; >> print_disk_probe_info(); >> open_result =3D devsw[devidx]->dv_open(&f, = &currdev); >> if (open_result =3D=3D 0) { >> printf(" good.\n"); >> return (0); >> } >> printf("\n"); >> } >> return (-1); >> } >> . . . >> } >>=20 >> So it appears that the crash is during the code involved >> for the line: >>=20 >> open_result =3D devsw[devidx]->dv_open(&f, &currdev); >>=20 >> Note that the boot attempt reported "Probing all <unknown> >> devices..." so device_typename(load_type) came up with >> "<unknown>". (I've no clue if that is significant to the >> issue or not.) >=20 >=20 > Looks like the below paragraph was junk. Sorry. >=20 >> It appeared that /usr/src/stand/usb/storage/umass_loader.c >> and its umass_disk_open and umass_disk_open_sub might >> be involved and then code from the likes of: >> /usr/src/sys/dev/usb/ --such as from usb_msctest.c for >> usb_msc_read_capacity. (I stopped looking around >> there: well outside areas I know how to interpret.) >=20 >=20 >> For reference, I used the OPi+2e u-boot material from >> my poudriere-devel based port builds: >>=20 >> # ls -ldT /usr/local/share/u-boot/u-boot-orangepi-plus-2e/* >> -rw-r--r-- 1 root wheel 503 Oct 26 19:12:16 2019 = /usr/local/share/u-boot/u-boot-orangepi-plus-2e/README >> -rw-r--r-- 1 root wheel 199 Oct 26 19:12:16 2019 = /usr/local/share/u-boot/u-boot-orangepi-plus-2e/boot.scr >> -rw-r--r-- 1 root wheel 66 Oct 26 19:12:16 2019 = /usr/local/share/u-boot/u-boot-orangepi-plus-2e/metadata >> -rw-r--r-- 1 root wheel 471250 Oct 26 19:12:16 2019 = /usr/local/share/u-boot/u-boot-orangepi-plus-2e/u-boot-sunxi-with-spl.bin >>=20 >>=20 >>=20 >> (The USB drives are USB SSDs.) >=20 >=20 >=20 >=20 > =3D=3D=3D > Mark Millard > marklmi at yahoo.com > ( dsl-only.net went > away in early 2018-Mar) >=20
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?26DEEA3C-C880-410C-BCB4-CE1314EAFA02>