RedPill - the new loader for 6.2.4 - Discussion


Recommended Posts

9 hours ago, erkify said:

Thanks for the work, I have used it on my baremetal G8 microserver and I have been able to update everything, but the system reboots after about 20 minutes and reboots again, any ideas? Thank you

 

If you are using docker, after couple of minutes it will stop responding. I have verified that even with a single container set to Low and 512MB, after some time it will stop responding. It is strange that it doesnt crash, it just hangs. Not able to see whats happening in the console as i dont have a serial port on my Gen7 that im using for tests.

Edited by pocopico
Link to post
Share on other sites
1 hour ago, Orphée said:


  "synoinfo": {
    "supportsystemperature": "no",
    "supportsystempwarning": "no"

  }

When I do this (or the command line equivalent previously mentioned), it fixes the empty General tab problem, but also seems to stop the system from polling the 'System' temperature, as I'm now getting alerts from my SNMP monitoring platform (LibreNMS) that the 'System' temperature is below the configured minimum:

2010714156_Screenshot2021-09-17at09_52_23.thumb.png.b3ede575ae50da699c507aff5cd39a11.png

 

It seems the DSM is polling the temperature properly from the hardware sensors (as it was exposing them via SNMP, and the measurements match what I can see on the iLO), but it's apparently not parsing the value correctly, leading to the blank tab in General.

 

On balance I'd rather have the correct data in my monitoring platform than the General information page in Control Panel, so I'll leave put it back as it was now, with 'supportsystemperature' and 'supportsystempwarning' set to 'yes'.

Link to post
Share on other sites
11 minutes ago, mcdull said:

I guess we should open another thread for helping others to use the loader and to leave this thread for development needs. 

now this thread is flooding with operational issue.

 

You can always open issues on Github for any issues that are addressed to the developers. This thread IMHO, helps people understand the concept, install using the loader on their test systems and that's pushing the development even further. Testing process anyway is a  development stage.

Edited by pocopico
  • Like 1
  • Thanks 1
Link to post
Share on other sites

I confirm the system crash once a docker container is running.

I installed default nginx container, ran it.... and system froze a few minutes later.

had to reset system (ESXi Virtual Machine)

 

Don't know if this is revelant :

image.thumb.png.3528cae70a9b91938fb4547504102a9a.png

Link to post
Share on other sites

Hey all

 

I've can build fine thanks to the info on here but can someone advise how I build a 918 build it is always the 3615xs that gets created.

 

I know I would love a DVA version build to get AI CCTV running with the 6 licences and Nvidia, Is this on the roadmap/possible?

 

Thanks for al the hard work

Link to post
Share on other sites
30 minutes ago, Orphée said:

I confirm the system crash once a docker container is running.

I installed default nginx container, ran it.... and system froze a few minutes later.

had to reset system (ESXi Virtual Machine)

 

Don't know if this is revelant :

image.thumb.png.3528cae70a9b91938fb4547504102a9a.png

Interesting that you're seeing this in ESXi, others have only reported it on baremetal so far. Are those messages from ESXi, or the xpenolgy guest?

 

What are you seeing on the serial output from the xpenology guest?

 

FWIW I was seeing output like this each time my system kernel panic'd and rebooted:

[  194.017549] Kernel panic - not syncing: Watchdog detected hard LOCKUP on cpu 2
[  194.052575] CPU: 2 PID: 19580 Comm: containerd-shim Tainted: PF          O 3.10.105 #25556
[  194.094270] Hardware name: HP ProLiant MicroServer Gen8, BIOS J06 04/04/2019
[  194.128400]  ffffffff814c904d ffffffff814c8121 0000000000000010 ffff880109a88d58
[  194.164811]  ffff880109a88cf0 0000000000000000 0000000000000002 000000000000002b
[  194.201332]  0000000000000002 ffffffff80000001 0000000000000010 ffff880103ee5c00
[  194.238138] Call Trace:
[  194.250471]  <NMI>  [<ffffffff814c904d>] ? dump_stack+0xc/0x15
[  194.279225]  [<ffffffff814c8121>] ? panic+0xbb/0x1ce
[  194.304100]  [<ffffffff810a0922>] ? watchdog_overflow_callback+0xb2/0xc0
[  194.337400]  [<ffffffff810b152b>] ? __perf_event_overflow+0x8b/0x240
[  194.368795]  [<ffffffff810b02d4>] ? perf_event_update_userpage+0x14/0xf0
[  194.401338]  [<ffffffff81015411>] ? intel_pmu_handle_irq+0x1d1/0x360
[  194.432957]  [<ffffffff81010026>] ? perf_event_nmi_handler+0x26/0x40
[  194.464708]  [<ffffffff81005fa8>] ? do_nmi+0xf8/0x3e0
[  194.488902]  [<ffffffff814cfa53>] ? end_repeat_nmi+0x1e/0x7e
[  194.517219]  <<EOE>> 
[  195.556746] Shutting down cpus with NMI
[  195.576047] Rebooting in 3 seconds..

 

Most of the time is was containerd-shim that was named in the messaging, but sometimes it was the specific process within the container, influxdb. It was always my influxdb container causing the crashes, and my system has been very stable since I stopped trying to run it. I have done a lot of searching on the internet and can find no mention of influxdb in docker causing kernel panics, therefore I'm almost certain that this is unique to redpill. I used influxdb for years on xpenology using Jun's bootloader and have only seen this moving to redpill, on 6.2.4 and 7.0.1-RC1.

 

Can we move this to the other topic I started, though? We're cluttering up the thread here...

 

https://xpenology.com/forum/topic/47970-docker-causing-kernel-panics-after-move-to-624-on-redpill/

 

Edited by WiteWulf
Link to post
Share on other sites
22 minutes ago, spv4u1975 said:

I've can build fine thanks to the info on here but can someone advise how I build a 918 build it is always the 3615xs that gets created.

 

I know I would love a DVA version build to get AI CCTV running with the 6 licences and Nvidia, Is this on the roadmap/possible?

You need to build the apollolake target to get 918, bromolow builds 3615xs.

 

Also, make sure your hardware (CPU and chipset) is compatible with apollolake.

  • Like 1
Link to post
Share on other sites
9 minutes ago, WiteWulf said:

Interesting that you're seeing this in ESXi, others have only reported it on baremetal so far. Are those messages from ESXi, or the xpenolgy guest?

 

What are you seeing on the serial output from the xpenology guest?

 

FWIW I was seeing output like this each time my system kernel panic'd and rebooted:




[  194.017549] Kernel panic - not syncing: Watchdog detected hard LOCKUP on cpu 2
[  194.052575] CPU: 2 PID: 19580 Comm: containerd-shim Tainted: PF          O 3.10.105 #25556
[  194.094270] Hardware name: HP ProLiant MicroServer Gen8, BIOS J06 04/04/2019
[  194.128400]  ffffffff814c904d ffffffff814c8121 0000000000000010 ffff880109a88d58
[  194.164811]  ffff880109a88cf0 0000000000000000 0000000000000002 000000000000002b
[  194.201332]  0000000000000002 ffffffff80000001 0000000000000010 ffff880103ee5c00
[  194.238138] Call Trace:
[  194.250471]  <NMI>  [<ffffffff814c904d>] ? dump_stack+0xc/0x15
[  194.279225]  [<ffffffff814c8121>] ? panic+0xbb/0x1ce
[  194.304100]  [<ffffffff810a0922>] ? watchdog_overflow_callback+0xb2/0xc0
[  194.337400]  [<ffffffff810b152b>] ? __perf_event_overflow+0x8b/0x240
[  194.368795]  [<ffffffff810b02d4>] ? perf_event_update_userpage+0x14/0xf0
[  194.401338]  [<ffffffff81015411>] ? intel_pmu_handle_irq+0x1d1/0x360
[  194.432957]  [<ffffffff81010026>] ? perf_event_nmi_handler+0x26/0x40
[  194.464708]  [<ffffffff81005fa8>] ? do_nmi+0xf8/0x3e0
[  194.488902]  [<ffffffff814cfa53>] ? end_repeat_nmi+0x1e/0x7e
[  194.517219]  <<EOE>> 
[  195.556746] Shutting down cpus with NMI
[  195.576047] Rebooting in 3 seconds..

 

Most of the time is was containerd-shim that was named in the messaging, but sometimes it was the specific process within the container, influxdb. It was always my influxdb container causing the crashes, and my system has been very stable since I stopped trying to run it. I have done a lot of searching on the internet and can find no mention of influxdb in docker causing kernel panics, therefore I'm almost certain that this is unique to redpill. I used influxdb for years on xpenology using Jun's bootloader and have only seen this moving to redpill, on 6.2.4 and 7.0.1-RC1.

 

Can we more this to the other topic I started, though? We're cluttering up the thread here...

 

https://xpenology.com/forum/topic/47970-docker-causing-kernel-panics-after-move-to-624-on-redpill/

 

 

Picture from xpenology guest.

 

The hint about IPV6 might be interesting :

 

Quote

[   19.778392] Synotify use 16384 event queue size
[   20.030329] ip6_tables: (C) 2000-2006 Netfilter Core Team
[   20.068195] aufs 3.10.x-20141110
[   20.076943] Bridge firewalling registered
[   20.086435] cgroup: systemd (1) created nested cgroup for controller "blkio" which has incomplete hierarchy support. Nested cgroups may change behavior in the future.
[   20.646395] <redpill/pmu_shim.c:310> Got 2 bytes from PMU: reason=1 hex={2d 33} ascii="-3"
[   20.647879] <redpill/pmu_shim.c:239> Executing cmd OUT_BUZ_LONG handler cmd_shim_noop+0x0/0x30 [redpill]
[   20.649544] <redpill/pmu_shim.c:45> vPMU received OUT_BUZ_LONG using 1 bytes - NOOP
[   21.143474] fuse init (API version 7.22)
[   21.466668] Initializing XFRM netlink socket
[   21.469339] Netfilter messages via NETLINK v0.30.
[   21.539973] IPv6: ADDRCONF(NETDEV_UP): docker0: link is not ready
[   22.646774] <redpill/pmu_shim.c:310> Got 2 bytes from PMU: reason=1 hex={2d 38} ascii="-8"
[   22.648188] <redpill/pmu_shim.c:239> Executing cmd OUT_STATUS_LED_ON_GREEN handler cmd_shim_noop+0x0/0x30 [redpill]
[   22.649863] <redpill/pmu_shim.c:45> vPMU received OUT_STATUS_LED_ON_GREEN using 1 bytes - NOOP
[   33.913043] Synotify use 16384 event queue size
[   34.650817] Synotify use 16384 event queue size
[   63.071479] device docker8d303b8 entered promiscuous mode
[   63.072593] IPv6: ADDRCONF(NETDEV_UP): docker8d303b8: link is not ready
[   63.602699] IPv6: ADDRCONF(NETDEV_CHANGE): docker8d303b8: link becomes ready
[   63.604006] docker0: port 1(docker8d303b8) entered forwarding state
[   63.605095] docker0: port 1(docker8d303b8) entered forwarding state
[   63.606410] IPv6: ADDRCONF(NETDEV_CHANGE): docker0: link becomes ready
[   72.294676] <redpill/smart_shim.c:794> Handling ioctl(0x31f) for /dev/sda
[   72.296079] <redpill/smart_shim.c:624> Got SMART *command* - looking for feature=0xd0
[   72.297396] <redpill/smart_shim.c:376> Generating fake SMART values
[   78.645421] docker0: port 1(docker8d303b8) entered forwarding state

and system froze.

 

Disabling DSM IPV6 seems to fix the issue... I'ill leave container running to confirm

Edited by Orphée
Link to post
Share on other sites

IPv6 is disabled in control panel on my system:

image.thumb.png.47b6e410c5e3cfa6431c33c3e667d018.png

 

...but docker still tries to setup IPv6 networking for some or all containers when they start.

 

IPv6 is also disabled on the docker networks on my setup:689981505_Screenshot2021-09-17at10_58_56.thumb.png.cc324f5f0272c735609285fe680756ff.png

 

Edited by WiteWulf
Link to post
Share on other sites
5 minutes ago, WiteWulf said:

IPv6 is disabled in control panel on my system:

image.thumb.png.47b6e410c5e3cfa6431c33c3e667d018.png

 

...but docker still tries to setup IPv6 networking for some or all containers when they start.

 

IPv6 is also disabled on the docker networks on my setup:689981505_Screenshot2021-09-17at10_58_56.thumb.png.cc324f5f0272c735609285fe680756ff.png

 

Same for me, but at least IPv6 disabled in DSM fix the crashes.

  • Like 1
Link to post
Share on other sites

is this Docker Crashes also in 6.2.4 or only in DSM7 ?

 

maybe docker has some problems with DSM7 :)

 

i will do a test today evening with my fresh Gen8 on DSM7 and a Docker Image.

Edited by altas
Link to post
Share on other sites

lol it is bit strange doing some stupid things reload the login page starting docker jd2 file download smb copy with 300 MB but the server will not crash since i have enabedl the ilo console in putty

creating docker etc

using the hammer

but the server will not crash 

 

 

Link to post
Share on other sites

I have disabled IPV 6 in the networkadapter also on Docker but the server still reboot saw this message at least:

[  817.217981] CPU: 0 PID: 2187 Comm: runc Tainted: PF        C O 3.10.108 #42214
[  817.253128] Hardware name: HP ProLiant MicroServer Gen8, BIOS J06 11/02/2015
[  817.287675]  ffffffff814a2759 ffffffff814a16b1 0000000000000010 ffff880309a08d60
[  817.324215]  ffff880309a08cf8 0000000000000000 0000000000000000 0000000000000001
[  817.360571]  0000000000000000 ffffffff80000001 0000000000000030 ffff8802f9a1d000
[  817.396778] Call Trace:
[  817.408374]  <NMI>  [<ffffffff814a2759>] ? dump_stack+0xc/0x15
[  817.436964]  [<ffffffff814a16b1>] ? panic+0xbb/0x1df
[  817.461381]  [<ffffffff810a9eb8>] ? watchdog_overflow_callback+0xa8/0xb0
[  817.494556]  [<ffffffff810db7d3>] ? __perf_event_overflow+0x93/0x230
[  817.525829]  [<ffffffff810da612>] ? perf_event_update_userpage+0x12/0xf0
[  817.558713]  [<ffffffff810152a4>] ? intel_pmu_handle_irq+0x1b4/0x340
[  817.590098]  [<ffffffff814a9d06>] ? perf_event_nmi_handler+0x26/0x40
[  817.621060]  [<ffffffff814a944e>] ? do_nmi+0xfe/0x440
[  817.645848]  [<ffffffff814a8a53>] ? end_repeat_nmi+0x1e/0x7e
[  817.673689]  <<EOE>>
[  817.684069] Rebooting in 3 seconds..

 

after the reboot:


[  141.234109] IPv6: ADDRCONF(NETDEV_UP): docker0: link is not ready
but in DSM IPV 6 is disabled 



During the Boot process after the server has crashed i found this :



FAILED   65.563176] Model: DS-3615xs
m] Failed to sta[   65.563177] set group disks wakeup number to 4, spinup time deno 7
rt Adjust NIC se[   65.563183] synobios cpu_arch proc entry initialized
[   65.563184] synobios crypto_hw proc entry initialized
[   65.563187] synobios syno_platform proc entry initialized
quence.
See "sy[   65.563190] <redpill/bios_shims_collection.c:51> Will print 416 bytes of memory from ffffffffa0986ca0
[   65.563199] 20 6f 98 a0 ff ff ff ff  [00] 0x000      ffffffffa0986f20        __this_module+0x0/0xffffffffffff8e13 [bromolow_synobios]
stemctl status S[   65.563208] c0 e7 97 a0 ff ff ff ff  [01] 0x008      ffffffffa097e7c0        GetBrand+0x0/0x10 [bromolow_synobios]
[   65.563212] c0 f0 97 a0 ff ff ff ff  [02] 0x010      ffffffffa097f0c0        GetModel+0x0/0x260 [bromolow_synobios]
[   65.563215] 00 00 00 00 00 00 00 00  [03] 0x018                (null)                  (null)
[   65.563219] d0 e1 97 a0 ff ff ff ff  [04] 0x020      ffffffffa097e1d0        rtc_bandon_get_time+0x0/0x1a0 [bromolow_synobios]
[   65.563224] 20 e4 97 a0 ff ff ff ff  [05] 0x028      ffffffffa097e420        rtc_bandon_set_time+0x0/0x370 [bromolow_synobios]
[   65.563226] 00 00 00 00 00 00 00 00  [06] 0x030                (null)                  (null)
[   65.563240] f0 b0 00 a0 ff ff ff ff  [07] 0x038      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563245] e0 ef 97 a0 ff ff ff ff  [08] 0x040      ffffffffa097efe0        GetSysTemperature+0x0/0x20 [bromolow_synobios]
[   65.563248] 80 f5 97 a0 ff ff ff ff  [09] 0x048      ffffffffa097f580        GetCpuTemperatureDenlowI3Transfer+0x0/0x80 [bromolow_synobios]
[   65.563253] 30 f5 97 a0 ff ff ff ff  [10] 0x050      ffffffffa097f530        SetDiskLedStatusBy9235GPIOandAHCISGPIO+0x0/0x50 [bromolow_synobios]
[   65.563257] f0 b0 00 a0 ff ff ff ff  [11] 0x058      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563259] 00 00 00 00 00 00 00 00  [12] 0x060                (null)                  (null)
[   65.563262] 00 00 00 00 00 00 00 00  [13] 0x068                (null)                  (null)
[   65.563264] 00 00 00 00 00 00 00 00  [14] 0x070                (null)                  (null)
[   65.563268] f0 b0 00 a0 ff ff ff ff  [15] 0x078      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563273] 00 b1 00 a0 ff ff ff ff  [16] 0x080      ffffffffa000b100        shim_get_gpio_pin_usable+0x0/0x10 [redpill]
[   65.563278] e0 b3 00 a0 ff ff ff ff  [17] 0x088      ffffffffa000b3e0        shim_null_zero_ulong_trace+0x0/0x10 [redpill]
[   65.563282] 70 e3 97 a0 ff ff ff ff  [18] 0x090      ffffffffa097e370        rtc_bandon_set_auto_poweron+0x0/0xb0 [bromolow_synobios]
[   65.563286] 20 df 97 a0 ff ff ff ff  [19] 0x098      ffffffffa097df20        rtc_get_auto_poweron+0x0/0x50 [bromolow_synobios]
[   65.563289] 00 00 00 00 00 00 00 00  [20] 0x0a0                (null)                  (null)
ynoInitEth.servi[   65.563290] 00 00 00 00 00 00 00 00  [21] 0x0a8                (null)                  (null)
[   65.563295] f0 b0 00 a0 ff ff ff ff  [22] 0x0b0      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563300] f0 b0 00 a0 ff ff ff ff  [23] 0x0b8      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563304] f0 b0 00 a0 ff ff ff ff  [24] 0x0c0      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563308] 60 e8 97 a0 ff ff ff ff  [25] 0x0c8      ffffffffa097e860        GetPowerStatus+0x0/0x30 [bromolow_synobios]
[   65.563310] 00 00 00 00 00 00 00 00  [26] 0x0d0                (null)                  (null)
[   65.563314] d0 e7 97 a0 ff ff ff ff  [27] 0x0d8      ffffffffa097e7d0        InitModuleType+0x0/0x30 [bromolow_synobios]
[   65.563318] d0 ed 97 a0 ff ff ff ff  [28] 0x0e0      ffffffffa097edd0        Uninitialize+0x0/0x20 [bromolow_synobios]
[   65.563322] f0 b0 00 a0 ff ff ff ff  [29] 0x0e8      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563327] f0 b0 00 a0 ff ff ff ff  [30] 0x0f0      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563331] f0 b0 00 a0 ff ff ff ff  [31] 0x0f8      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563334] 00 00 00 00 00 00 00 00  [32] 0x100                (null)                  (null)
[   65.563338] f0 b0 00 a0 ff ff ff ff  [33] 0x108      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563342] f0 b0 00 a0 ff ff ff ff  [34] 0x110      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563345] 00 00 00 00 00 00 00 00  [35] 0x118                (null)                  (null)
[   65.563347] 00 00 00 00 00 00 00 00  [36] 0x120                (null)                  (null)
[   65.563349] 00 00 00 00 00 00 00 00  [37] 0x128                (null)                  (null)
[   65.563351] 00 00 00 00 00 00 00 00  [38] 0x130                (null)                  (null)
[   65.563353] 00 00 00 00 00 00 00 00  [39] 0x138                (null)                  (null)
[   65.563357] 30 e9 97 a0 ff ff ff ff  [40] 0x140      ffffffffa097e930        GetCPUInfo+0x0/0x70 [bromolow_synobios]
[   65.563359] 00 00 00 00 00 00 00 00  [41] 0x148                (null)                  (null)
[   65.563361] 00 00 00 00 00 00 00 00  [42] 0x150                (null)                  (null)
[   65.563365] 70 ed 97 a0 ff ff ff ff  [43] 0x158      ffffffffa097ed70        HWMONGetFanSpeedRPMFromADT+0x0/0x60 [bromolow_synobios]
ce" for details.[   65.563367] 00 00 00 00 00 00 00 00  [44] 0x160                (null)                  (null)
[   65.563371] 10 ed 97 a0 ff ff ff ff  [45] 0x168      ffffffffa097ed10        HWMONGetVoltageSensorFromADT+0x0/0x60 [bromolow_synobios]
[   65.563373] 00 00 00 00 00 00 00 00  [46] 0x170                (null)                  (null)
[   65.563376] b0 ec 97 a0 ff ff ff ff  [47] 0x178      ffffffffa097ecb0        HWMONGetThermalSensorFromADT+0x0/0x60 [bromolow_synobios]
[   65.563549] 00 00 00 00 00 00 00 00  [48] 0x180                (null)                  (null)
[   65.563551] 00 00 00 00 00 00 00 00  [49] 0x188                (null)                  (null)
[   65.563552] 00 00 00 00 00 00 00 00  [50] 0x190                (null)                  (null)
[   65.563554] 00 00 00 00 00 00 00 00  [51] 0x198                (null)                  (null)
[   65.563555]
[   65.563555] <redpill/bios_shims_collection.c:64> Finished printing memory at ffffffffa0986e40

[   65.563559] <redpill/bios_shims_collection.c:34> mfgBIOS vtable [6] originally           (null)<          (null)> will now be shim_null_zero_ulong [redpill]<ffffffffa000b0f0>
[   65.563564] <redpill/bios_shims_collection.c:34> mfgBIOS vtable [10] originally SetDiskLedStatusBy9235GPIOandAHCISGPIO [bromolow_synobios]<ffffffffa097f530> will now be shim_null_zero_ulong [redpill]<ffffffffa000b0f0>
[   65.563566] <redpill/bios_shims_collection.c:109> Native RTC supported - not enabling proxy (emulate_rtc=0)
[   65.563566] <redpill/bios_shims_collection.c:51> Will print 416 bytes of memory from ffffffffa0986ca0
[   65.563571] 20 6f 98 a0 ff ff ff ff  [00] 0x000      ffffffffa0986f20        __this_module+0x0/0xffffffffffff8e13 [bromolow_synobios]
[   65.563575] c0 e7 97 a0 ff ff ff ff  [01] 0x008      ffffffffa097e7c0        GetBrand+0x0/0x10 [bromolow_synobios]
[   65.563578] c0 f0 97 a0 ff ff ff ff  [02] 0x010      ffffffffa097f0c0        GetModel+0x0/0x260 [bromolow_synobios]
[   65.563580] 00 00 00 00 00 00 00 00  [03] 0x018                (null)                  (null)
[   65.563585] d0 e1 97 a0 ff ff ff ff  [04] 0x020      ffffffffa097e1d0        rtc_bandon_get_time+0x0/0x1a0 [bromolow_synobios]
[   65.563588] 20 e4 97 a0 ff ff ff ff  [05] 0x028      ffffffffa097e420        rtc_bandon_set_time+0x0/0x370 [bromolow_synobios]
[   65.563593] f0 b0 00 a0 ff ff ff ff  [06] 0x030      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563597] f0 b0 00 a0 ff ff ff ff  [07] 0x038      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563601] e0 ef 97 a0 ff ff ff ff  [08] 0x040      ffffffffa097efe0        GetSysTemperature+0x0/0x20 [bromolow_synobios]
[   65.563605] 80 f5 97 a0 ff ff ff ff  [09] 0x048      ffffffffa097f580        GetCpuTemperatureDenlowI3Transfer+0x0/0x80 [bromolow_synobios]
[   65.563610] f0 b0 00 a0 ff ff ff ff  [10] 0x050      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563614] f0 b0 00 a0 ff ff ff ff  [11] 0x058      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563617] 00 00 00 00 00 00 00 00  [12] 0x060                (null)                  (null)
[   65.563619] 00 00 00 00 00 00 00 00  [13] 0x068                (null)                  (null)
[   65.563621] 00 00 00 00 00 00 00 00  [14] 0x070                (null)                  (null)
[   65.563625] f0 b0 00 a0 ff ff ff ff  [15] 0x078      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563630] 00 b1 00 a0 ff ff ff ff  [16] 0x080      ffffffffa000b100        shim_get_gpio_pin_usable+0x0/0x10 [redpill]
[   65.563634] e0 b3 00 a0 ff ff ff ff  [17] 0x088      ffffffffa000b3e0        shim_null_zero_ulong_trace+0x0/0x10 [redpill]
[   65.563638] 70 e3 97 a0 ff ff ff ff  [18] 0x090      ffffffffa097e370        rtc_bandon_set_auto_poweron+0x0/0xb0 [bromolow_synobios]
[   65.563642] 20 df 97 a0 ff ff ff ff  [19] 0x098      ffffffffa097df20        rtc_get_auto_poweron+0x0/0x50 [bromolow_synobios]
[   65.563644] 00 00 00 00 00 00 00 00  [20] 0x0a0                (null)                  (null)
[   65.563646] 00 00 00 00 00 00 00 00  [21] 0x0a8                (null)                  (null)
[   65.563651] f0 b0 00 a0 ff ff ff ff  [22] 0x0b0      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563655] f0 b0 00 a0 ff ff ff ff  [23] 0x0b8      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563660] f0 b0 00 a0 ff ff ff ff  [24] 0x0c0      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563663] 60 e8 97 a0 ff ff ff ff  [25] 0x0c8      ffffffffa097e860        GetPowerStatus+0x0/0x30 [bromolow_synobios]
[   65.563666] 00 00 00 00 00 00 00 00  [26] 0x0d0                (null)                  (null)
[   65.563669] d0 e7 97 a0 ff ff ff ff  [27] 0x0d8      ffffffffa097e7d0        InitModuleType+0x0/0x30 [bromolow_synobios]
[   65.563673] d0 ed 97 a0 ff ff ff ff  [28] 0x0e0      ffffffffa097edd0        Uninitialize+0x0/0x20 [bromolow_synobios]
[   65.563678] f0 b0 00 a0 ff ff ff ff  [29] 0x0e8      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563682] f0 b0 00 a0 ff ff ff ff  [30] 0x0f0      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563687] f0 b0 00 a0 ff ff ff ff  [31] 0x0f8      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563689] 00 00 00 00 00 00 00 00  [32] 0x100                (null)                  (null)
[   65.563693] f0 b0 00 a0 ff ff ff ff  [33] 0x108      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563698] f0 b0 00 a0 ff ff ff ff  [34] 0x110      ffffffffa000b0f0        shim_null_zero_ulong+0x0/0x10 [redpill]
[   65.563700] 00 00 00 00 00 00 00 00  [35] 0x118                (null)                  (null)
[   65.563702] 00 00 00 00 00 00 00 00  [36] 0x120                (null)                  (null)
[   65.563704] 00 00 00 00 00 00 00 00  [37] 0x128                (null)                  (null)
[   65.563706] 00 00 00 00 00 00 00 00  [38] 0x130                (null)                  (null)
[   65.563708] 00 00 00 00 00 00 00 00  [39] 0x138                (null)                  (null)
[   65.563712] 30 e9 97 a0 ff ff ff ff  [40] 0x140      ffffffffa097e930        GetCPUInfo+0x0/0x70 [bromolow_synobios]
[   65.563714] 00 00 00 00 00 00 00 00  [41] 0x148                (null)                  (null)
[   65.563716] 00 00 00 00 00 00 00 00  [42] 0x150                (null)                  (null)
[   65.563720] 70 ed 97 a0 ff ff ff ff  [43] 0x158      ffffffffa097ed70        HWMONGetFanSpeedRPMFromADT+0x0/0x60 [bromolow_synobios]
[   65.563722] 00 00 00 00 00 00 00 00  [44] 0x160                (null)                  (null)
[   65.563726] 10 ed 97 a0 ff ff ff ff  [45] 0x168      ffffffffa097ed10        HWMONGetVoltageSensorFromADT+0x0/0x60 [bromolow_synobios]
[   65.563728] 00 00 00 00 00 00 00 00  [46] 0x170                (null)                  (null)
[   65.563732] b0 ec 97 a0 ff ff ff ff  [47] 0x178      ffffffffa097ecb0        HWMONGetThermalSensorFromADT+0x0/0x60 [bromolow_synobios]
[   65.563734] 00 00 00 00 00 00 00 00  [48] 0x180                (null)                  (null)
[   65.563736] 00 00 00 00 00 00 00 00  [49] 0x188                (null)                  (null)
[   65.563738] 00 00 00 00 00 00 00 00  [50] 0x190                (null)                  (null)
[   65.563740] 00 00 00 00 00 00 00 00  [51] 0x198                (null)                  (null)
[   65.563740]
[   65.563741] <redpill/bios_shims_collection.c:64> Finished printing memory at ffffffffa0986e40
[   65.563741] <redpill/bios_shim.c:124> bromolow_synobios BIOS *fully* shimmed


 

Edited by nemesis122
Link to post
Share on other sites

Yes actually, system is still unstable...

Lot of logs like this :

 

Quote

[ 5507.900018] INFO: task kworker/u4:1:22566 blocked for more than 120 seconds.
[ 5507.901167] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 5507.902445] kworker/u4:1    D ffff88013dd12f40     0 22566      2 0x00000000
[ 5507.903661] Workqueue: btrfs-worker btrfs_worker_helper [btrfs]
[ 5507.904657]  ffff880102e5fa90 0000000000000046 000000000000c000 ffff880102e5ffd8
[ 5507.905955]  ffff880102e5ffd8 ffff8801353de040 0000000000000000 ffff880102e5fa90
[ 5507.907251]  ffffffff8105f934 ffff880137deb800 ffff880137deb800 0000000000000040
[ 5507.908564] Call Trace:
[ 5507.908978]  [<ffffffff8105f934>] ? __wake_up+0x34/0x50
[ 5507.909836]  [<ffffffff813c28c5>] ? md_write_start+0xa5/0x190
[ 5507.910789]  [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60
[ 5507.911802]  [<ffffffff813ba1fe>] ? make_request+0x7e/0xe30
[ 5507.912714]  [<ffffffffa02248c8>] ? flashcache_map_core.isra.18+0x2b8/0x1640 [flashcache_syno]
[ 5507.914109]  [<ffffffff811211dd>] ? cache_alloc_refill+0x6d/0x6a0
[ 5507.915103]  [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60
[ 5507.916141]  [<ffffffff813bdade>] ? md_handle_request+0x8e/0xe0
[ 5507.917110]  [<ffffffff813c66b1>] ? md_make_request+0x251/0x460
[ 5507.918082]  [<ffffffff812705d9>] ? generic_make_request+0xd9/0x2b0
[ 5507.919103]  [<ffffffff81270827>] ? submit_bio+0x77/0x190
[ 5507.919991]  [<ffffffffa0214150>] ? flashcache_merge+0x40/0xd0 [flashcache_syno]
[ 5507.921196]  [<ffffffffa01da44d>] ? dm_merge_bvec+0xcd/0xf0 [dm_mod]
[ 5507.922224]  [<ffffffffa0aa4e68>] ? __btrfs_map_bio+0x1e8/0x540 [btrfs]
[ 5507.923301]  [<ffffffffa0a6360e>] ? btrfs_csum_one_bio+0x2ee/0x330 [btrfs]
[ 5507.924450]  [<ffffffffa0a74df7>] ? __btrfs_submit_bio_done+0x17/0x40 [btrfs]
[ 5507.925618]  [<ffffffffa0a65d43>] ? run_one_async_start+0x23/0x30 [btrfs]
[ 5507.926726]  [<ffffffffa0aa93e7>] ? btrfs_worker_helper+0x127/0x3b0 [btrfs]
[ 5507.927866]  [<ffffffff8105f553>] ? worker_run_work+0xa3/0xf0
[ 5507.928814]  [<ffffffffa0aa92c0>] ? __btrfs_alloc_workqueue+0x160/0x160 [btrfs]
[ 5507.930003]  [<ffffffff81050fce>] ? process_one_work+0x14e/0x520
[ 5507.930985]  [<ffffffff81051d98>] ? worker_thread+0x108/0x420
[ 5507.931928]  [<ffffffff81051c90>] ? manage_workers.isra.30+0x260/0x260
[ 5507.932991]  [<ffffffff81057b01>] ? kthread+0xb1/0xc0
[ 5507.933829]  [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160
[ 5507.934823]  [<ffffffff814afe0d>] ? ret_from_fork+0x5d/0xb0
[ 5507.935736]  [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160
[ 5627.839951] INFO: task btrfs-transacti:8021 blocked for more than 120 seconds.
[ 5627.841248] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 5627.842544] btrfs-transacti D ffff88013dd12f40     0  8021      2 0x00000000
[ 5627.843763]  ffff88012800b790 0000000000000046 000000000000c000 ffff88012800bfd8
[ 5627.845084]  ffff88012800bfd8 ffff880134d34040 ffff8801248a8e00 ffff88013338f400
[ 5627.846401]  0000000002d40000 ffff880137deb800 ffff880137deb800 0000000000000080
[ 5627.847712] Call Trace:
[ 5627.848139]  [<ffffffff813c28c5>] ? md_write_start+0xa5/0x190
[ 5627.849096]  [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60
[ 5627.850149]  [<ffffffff813ba1fe>] ? make_request+0x7e/0xe30
[ 5627.851116]  [<ffffffffa0acb3a5>] ? add_delayed_tree_ref.isra.15+0xd5/0x1a0 [btrfs]
[ 5627.852425]  [<ffffffffa02248c8>] ? flashcache_map_core.isra.18+0x2b8/0x1640 [flashcache_syno]
[ 5627.853846]  [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60
[ 5627.854900]  [<ffffffff813bdade>] ? md_handle_request+0x8e/0xe0
[ 5627.855888]  [<ffffffff813c66b1>] ? md_make_request+0x251/0x460
[ 5627.856871]  [<ffffffff812705d9>] ? generic_make_request+0xd9/0x2b0
[ 5627.857908]  [<ffffffff81270827>] ? submit_bio+0x77/0x190
[ 5627.858800]  [<ffffffff8116963c>] ? __bio_clone+0x1c/0xa0
[ 5627.859695]  [<ffffffff8116ad08>] ? bio_clone_bioset+0x28/0x60
[ 5627.860671]  [<ffffffffa0aa4e68>] ? __btrfs_map_bio+0x1e8/0x540 [btrfs]
[ 5627.861788]  [<ffffffffa0a69408>] ? btree_submit_bio_hook+0xa8/0x160 [btrfs]
[ 5627.862974]  [<ffffffffa0a9328f>] ? submit_one_bio+0x5f/0x90 [btrfs]
[ 5627.864036]  [<ffffffffa0a96dc3>] ? submit_extent_page.constprop.52+0x123/0x290 [btrfs]
[ 5627.865363]  [<ffffffffa0a933b0>] ? end_extent_buffer_writeback+0x20/0x20 [btrfs]
[ 5627.866602]  [<ffffffffa0a9cca3>] ? write_one_eb+0x1a3/0x2e0 [btrfs]
[ 5627.867663]  [<ffffffffa0a933b0>] ? end_extent_buffer_writeback+0x20/0x20 [btrfs]
[ 5627.868908]  [<ffffffffa0a9cfab>] ? btree_write_cache_pages+0x1cb/0x3a0 [btrfs]
[ 5627.870117]  [<ffffffff810ee052>] ? do_writepages+0x22/0x80
[ 5627.871042]  [<ffffffff810e3a49>] ? __filemap_fdatawrite_range+0x49/0x50
[ 5627.872154]  [<ffffffffa0a71973>] ? btrfs_write_marked_extents+0xc3/0x190 [btrfs]
[ 5627.873398]  [<ffffffffa0a7253c>] ? btrfs_commit_transaction+0x7cc/0xcc0 [btrfs]
[ 5627.874624]  [<ffffffffa0a6ce3d>] ? transaction_kthread+0x26d/0x2e0 [btrfs]
[ 5627.875778]  [<ffffffffa0a6cbd0>] ? btrfs_cleanup_transaction+0x5c0/0x5c0 [btrfs]
[ 5627.877010]  [<ffffffff81057b01>] ? kthread+0xb1/0xc0
[ 5627.877845]  [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160
[ 5627.878857]  [<ffffffff814afe0d>] ? ret_from_fork+0x5d/0xb0
[ 5627.879785]  [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160
[ 5627.880844] INFO: task kworker/u4:1:22566 blocked for more than 120 seconds.
[ 5627.882030] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 5627.883353] kworker/u4:1    D ffff88013dd12f40     0 22566      2 0x00000000
[ 5627.884584] Workqueue: btrfs-worker btrfs_worker_helper [btrfs]
[ 5627.885591]  ffff880102e5fa90 0000000000000046 000000000000c000 ffff880102e5ffd8
[ 5627.886927]  ffff880102e5ffd8 ffff8801353de040 0000000000000000 ffff880102e5fa90
[ 5627.888243]  ffffffff8105f934 ffff880137deb800 ffff880137deb800 0000000000000040
[ 5627.889563] Call Trace:
[ 5627.889987]  [<ffffffff8105f934>] ? __wake_up+0x34/0x50
[ 5627.890848]  [<ffffffff813c28c5>] ? md_write_start+0xa5/0x190
[ 5627.891797]  [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60
[ 5627.892843]  [<ffffffff813ba1fe>] ? make_request+0x7e/0xe30
[ 5627.893774]  [<ffffffffa02248c8>] ? flashcache_map_core.isra.18+0x2b8/0x1640 [flashcache_syno]
[ 5627.895178]  [<ffffffff811211dd>] ? cache_alloc_refill+0x6d/0x6a0
[ 5627.896179]  [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60
[ 5627.897224]  [<ffffffff813bdade>] ? md_handle_request+0x8e/0xe0
[ 5627.898200]  [<ffffffff813c66b1>] ? md_make_request+0x251/0x460
[ 5627.899174]  [<ffffffff812705d9>] ? generic_make_request+0xd9/0x2b0
[ 5627.900204]  [<ffffffff81270827>] ? submit_bio+0x77/0x190
[ 5627.901104]  [<ffffffffa0214150>] ? flashcache_merge+0x40/0xd0 [flashcache_syno]
[ 5627.902320]  [<ffffffffa01da44d>] ? dm_merge_bvec+0xcd/0xf0 [dm_mod]
[ 5627.903372]  [<ffffffffa0aa4e68>] ? __btrfs_map_bio+0x1e8/0x540 [btrfs]
[ 5627.904478]  [<ffffffffa0a6360e>] ? btrfs_csum_one_bio+0x2ee/0x330 [btrfs]
[ 5627.905621]  [<ffffffffa0a74df7>] ? __btrfs_submit_bio_done+0x17/0x40 [btrfs]
[ 5627.906805]  [<ffffffffa0a65d43>] ? run_one_async_start+0x23/0x30 [btrfs]
[ 5627.908116]  [<ffffffffa0aa93e7>] ? btrfs_worker_helper+0x127/0x3b0 [btrfs]
[ 5627.909261]  [<ffffffff8105f553>] ? worker_run_work+0xa3/0xf0
[ 5627.910219]  [<ffffffffa0aa92c0>] ? __btrfs_alloc_workqueue+0x160/0x160 [btrfs]
[ 5627.911426]  [<ffffffff81050fce>] ? process_one_work+0x14e/0x520
[ 5627.912420]  [<ffffffff81051d98>] ? worker_thread+0x108/0x420
[ 5627.913369]  [<ffffffff81051c90>] ? manage_workers.isra.30+0x260/0x260
[ 5627.914442]  [<ffffffff81057b01>] ? kthread+0xb1/0xc0
[ 5627.915284]  [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160
[ 5627.916288]  [<ffffffff814afe0d>] ? ret_from_fork+0x5d/0xb0
[ 5627.917208]  [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160

Edit :

CPU also does not like at all the docker container running :

image.thumb.png.ba0f69aeb42beabe9c613a011e6500b5.png

 

I had to reset VM.

Edited by Orphée
Link to post
Share on other sites
1 hour ago, altas said:

is this Docker Crashes also in 6.2.4 or only in DSM7 ?

 

maybe docker has some problems with DSM7 :)

 

i will do a test today evening with my fresh Gen8 on DSM7 and a Docker Image.

I saw these crashes when I first migrated to 6.2.4 and they persisted into 7.0.1-RC1

Link to post
Share on other sites
34 minutes ago, nemesis122 said:

[  817.217981] CPU: 0 PID: 2187 Comm: runc Tainted: PF        C O 3.10.108 #42214
[  817.253128] Hardware name: HP ProLiant MicroServer Gen8, BIOS J06 11/02/2015
[  817.287675]  ffffffff814a2759 ffffffff814a16b1 0000000000000010 ffff880309a08d60
[  817.324215]  ffff880309a08cf8 0000000000000000 0000000000000000 0000000000000001
[  817.360571]  0000000000000000 ffffffff80000001 0000000000000030 ffff8802f9a1d000
[  817.396778] Call Trace:
[  817.408374]  <NMI>  [<ffffffff814a2759>] ? dump_stack+0xc/0x15
[  817.436964]  [<ffffffff814a16b1>] ? panic+0xbb/0x1df
[  817.461381]  [<ffffffff810a9eb8>] ? watchdog_overflow_callback+0xa8/0xb0
[  817.494556]  [<ffffffff810db7d3>] ? __perf_event_overflow+0x93/0x230
[  817.525829]  [<ffffffff810da612>] ? perf_event_update_userpage+0x12/0xf0
[  817.558713]  [<ffffffff810152a4>] ? intel_pmu_handle_irq+0x1b4/0x340
[  817.590098]  [<ffffffff814a9d06>] ? perf_event_nmi_handler+0x26/0x40
[  817.621060]  [<ffffffff814a944e>] ? do_nmi+0xfe/0x440
[  817.645848]  [<ffffffff814a8a53>] ? end_repeat_nmi+0x1e/0x7e
[  817.673689]  <<EOE>>
[  817.684069] Rebooting in 3 seconds..

Thanks for grabbing the console output 👍

 

This actually looks a little different to what I've been seeing (crashes with containerd-shim). Yours is indicating a problem with 'runs', and seems related to a thread I found while searching this morning:

https://github.com/opencontainers/runc/issues/2530

 

This still manifests as problems with containerd. Some people are seeing reboots, some are simply seeing lockups.

 

You really ought to update the BIOS on your Gen8, by the way ;), and check out whatever other firmware updates are available for it...

  • Haha 1
Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.