[Letux-kernel] [Gta04-owner] New LetuxOS Kernels

Andreas Kemnade andreas at kemnade.info
Tue Jun 19 23:32:08 CEST 2018


Hi,

On Tue, 19 Jun 2018 20:07:23 +0200
"H. Nikolaus Schaller" <hns at goldelico.com> wrote:

> [technical discussions should go to the proper mailing lists]
> 
> Hi Andreas,
> 
> > Am 19.06.2018 um 19:46 schrieb H. Nikolaus Schaller <hns at goldelico.com>:
> > 
> > Hi,
> >   
> >> Am 19.06.2018 um 19:22 schrieb Andreas Kemnade <andreas at kemnade.info>:
> >> 
> >> On Tue, 19 Jun 2018 11:38:26 +0200
> >> "H. Nikolaus Schaller" <hns at goldelico.com> wrote:
> >>   
> >>> Hi,
> >>> we did hunt a while for a big bug in the kernel.
> >>> It was a little flaky and random but the main
> >>> symptom was a kernel NULL pointer panic in
> >>> strcmp().
> >>> 
> >>> The result was that the GTA04 (and also the Pyra)
> >>> did not continue to boot properly. Sometimes it
> >>> came to a login: but then several drivers were
> >>> missing.
> >>> 
> >>> After hunting down the bug it was not a device
> >>> driver but a race and dangling pointer problem
> >>> in the pincontrol subsystem.
> >>> 
> >>> Now we got some patches from the maintainer and
> >>> the problem seems to have disappeared.
> >>> 
> >>> Well, has it or hasn't is?  
> >> 
> >> compiled dca26f608a765008b869991bf29fa241769599fb + my compile fix
> >> 
> >> result: strcmp problem again.
> >> [    7.335571] [<c074abc0>] (strcmp) from [<c04266bc>] (pinctrl_generic_add_group+0x50/0xc4)
> >> [    7.344116] [<c04266bc>] (pinctrl_generic_add_group) from [<c042b920>] (pcs_dt_node_to_map+0x484/0x820)
> >> [    7.353912] [<c042b920>] (pcs_dt_node_to_map) from [<c04298f4>] (pinctrl_dt_to_map+0x220/0x2bc)
> >> [    7.363006] [<c04298f4>] (pinctrl_dt_to_map) from [<c04269c4>] (create_pinctrl+0x5c/0x318)  
> > 
> > Hm...
> > 
> > I did a quick boot - and on first boot I also got a strcmp(NULL).
> > From the SD card which I had used for extensive testing yesterday.
> > 
> > What the hell is going on here?  
> 
> Maybe it is still a bug to devm_kzalloc something and store in the radix
> tree and leave it there, even if the driver is detached?
> 
> Then we still try to access this memory region by scanning the tree.
> 
> For test purposes we could replace the devm_kzalloc by kzalloc. This
> whould leak a little memory, but my hope is that the problem disappears.
> 
> Do you have a repeatable (at least >some%) scenario to reproduce the
> bug?

unfortunately not, maybe we should pass init=/modprobe-mess.sh to
kernel commandline, and create a worst case modprobe scenario there.
So we can control probing order more.

Regards,
Andreas
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <http://lists.goldelico.com/pipermail/letux-kernel/attachments/20180619/3e2cdae1/attachment.asc>


More information about the Letux-kernel mailing list