pocopico Posted September 17, 2021 Share #1651 Posted September 17, 2021 (edited) 9 hours ago, erkify said: Thanks for the work, I have used it on my baremetal G8 microserver and I have been able to update everything, but the system reboots after about 20 minutes and reboots again, any ideas? Thank you If you are using docker, after couple of minutes it will stop responding. I have verified that even with a single container set to Low and 512MB, after some time it will stop responding. It is strange that it doesnt crash, it just hangs. Not able to see whats happening in the console as i dont have a serial port on my Gen7 that im using for tests. Edited September 17, 2021 by pocopico Link to comment Share on other sites More sharing options...
nemesis122 Posted September 17, 2021 Share #1652 Posted September 17, 2021 (edited) it is working i will check the log is there option to save the serial output automatic ? Edited September 17, 2021 by nemesis122 Link to comment Share on other sites More sharing options...
nemesis122 Posted September 17, 2021 Share #1653 Posted September 17, 2021 this console output makes me ....😆 Link to comment Share on other sites More sharing options...
mcdull Posted September 17, 2021 Share #1654 Posted September 17, 2021 I guess we should open another thread for helping others to use the loader and to leave this thread for development needs. now this thread is flooding with operational issue. 3 1 Link to comment Share on other sites More sharing options...
WiteWulf Posted September 17, 2021 Share #1655 Posted September 17, 2021 1 hour ago, Orphée said: "synoinfo": { "supportsystemperature": "no", "supportsystempwarning": "no" } When I do this (or the command line equivalent previously mentioned), it fixes the empty General tab problem, but also seems to stop the system from polling the 'System' temperature, as I'm now getting alerts from my SNMP monitoring platform (LibreNMS) that the 'System' temperature is below the configured minimum: It seems the DSM is polling the temperature properly from the hardware sensors (as it was exposing them via SNMP, and the measurements match what I can see on the iLO), but it's apparently not parsing the value correctly, leading to the blank tab in General. On balance I'd rather have the correct data in my monitoring platform than the General information page in Control Panel, so I'll leave put it back as it was now, with 'supportsystemperature' and 'supportsystempwarning' set to 'yes'. Link to comment Share on other sites More sharing options...
pocopico Posted September 17, 2021 Share #1656 Posted September 17, 2021 (edited) 11 minutes ago, mcdull said: I guess we should open another thread for helping others to use the loader and to leave this thread for development needs. now this thread is flooding with operational issue. You can always open issues on Github for any issues that are addressed to the developers. This thread IMHO, helps people understand the concept, install using the loader on their test systems and that's pushing the development even further. Testing process anyway is a development stage. Edited September 17, 2021 by pocopico 1 1 Link to comment Share on other sites More sharing options...
Orphée Posted September 17, 2021 Share #1657 Posted September 17, 2021 I confirm the system crash once a docker container is running. I installed default nginx container, ran it.... and system froze a few minutes later. had to reset system (ESXi Virtual Machine) Don't know if this is revelant : Link to comment Share on other sites More sharing options...
spv4u1975 Posted September 17, 2021 Share #1658 Posted September 17, 2021 Hey all I've can build fine thanks to the info on here but can someone advise how I build a 918 build it is always the 3615xs that gets created. I know I would love a DVA version build to get AI CCTV running with the 6 licences and Nvidia, Is this on the roadmap/possible? Thanks for al the hard work Link to comment Share on other sites More sharing options...
erkify Posted September 17, 2021 Share #1659 Posted September 17, 2021 With docker deactivated the system is stable in G8 baremetal, I will do this weekend some test with the intel or internal nic to see if it has something to do, I will let the volume be debugged. Link to comment Share on other sites More sharing options...
WiteWulf Posted September 17, 2021 Share #1660 Posted September 17, 2021 (edited) 30 minutes ago, Orphée said: I confirm the system crash once a docker container is running. I installed default nginx container, ran it.... and system froze a few minutes later. had to reset system (ESXi Virtual Machine) Don't know if this is revelant : Interesting that you're seeing this in ESXi, others have only reported it on baremetal so far. Are those messages from ESXi, or the xpenolgy guest? What are you seeing on the serial output from the xpenology guest? FWIW I was seeing output like this each time my system kernel panic'd and rebooted: [ 194.017549] Kernel panic - not syncing: Watchdog detected hard LOCKUP on cpu 2 [ 194.052575] CPU: 2 PID: 19580 Comm: containerd-shim Tainted: PF O 3.10.105 #25556 [ 194.094270] Hardware name: HP ProLiant MicroServer Gen8, BIOS J06 04/04/2019 [ 194.128400] ffffffff814c904d ffffffff814c8121 0000000000000010 ffff880109a88d58 [ 194.164811] ffff880109a88cf0 0000000000000000 0000000000000002 000000000000002b [ 194.201332] 0000000000000002 ffffffff80000001 0000000000000010 ffff880103ee5c00 [ 194.238138] Call Trace: [ 194.250471] <NMI> [<ffffffff814c904d>] ? dump_stack+0xc/0x15 [ 194.279225] [<ffffffff814c8121>] ? panic+0xbb/0x1ce [ 194.304100] [<ffffffff810a0922>] ? watchdog_overflow_callback+0xb2/0xc0 [ 194.337400] [<ffffffff810b152b>] ? __perf_event_overflow+0x8b/0x240 [ 194.368795] [<ffffffff810b02d4>] ? perf_event_update_userpage+0x14/0xf0 [ 194.401338] [<ffffffff81015411>] ? intel_pmu_handle_irq+0x1d1/0x360 [ 194.432957] [<ffffffff81010026>] ? perf_event_nmi_handler+0x26/0x40 [ 194.464708] [<ffffffff81005fa8>] ? do_nmi+0xf8/0x3e0 [ 194.488902] [<ffffffff814cfa53>] ? end_repeat_nmi+0x1e/0x7e [ 194.517219] <<EOE>> [ 195.556746] Shutting down cpus with NMI [ 195.576047] Rebooting in 3 seconds.. Most of the time is was containerd-shim that was named in the messaging, but sometimes it was the specific process within the container, influxdb. It was always my influxdb container causing the crashes, and my system has been very stable since I stopped trying to run it. I have done a lot of searching on the internet and can find no mention of influxdb in docker causing kernel panics, therefore I'm almost certain that this is unique to redpill. I used influxdb for years on xpenology using Jun's bootloader and have only seen this moving to redpill, on 6.2.4 and 7.0.1-RC1. Can we move this to the other topic I started, though? We're cluttering up the thread here... https://xpenology.com/forum/topic/47970-docker-causing-kernel-panics-after-move-to-624-on-redpill/ Edited September 17, 2021 by WiteWulf Link to comment Share on other sites More sharing options...
dodo-dk Posted September 17, 2021 Share #1661 Posted September 17, 2021 Same problem here in Proxmox after some Docker starts and stops. I have deactivated the IPv6 Support in the DSM Network Settings (not in Docker Settings) and it seams to be stable at the moment. 1 Link to comment Share on other sites More sharing options...
WiteWulf Posted September 17, 2021 Share #1662 Posted September 17, 2021 22 minutes ago, spv4u1975 said: I've can build fine thanks to the info on here but can someone advise how I build a 918 build it is always the 3615xs that gets created. I know I would love a DVA version build to get AI CCTV running with the 6 licences and Nvidia, Is this on the roadmap/possible? You need to build the apollolake target to get 918, bromolow builds 3615xs. Also, make sure your hardware (CPU and chipset) is compatible with apollolake. 1 Link to comment Share on other sites More sharing options...
Orphée Posted September 17, 2021 Share #1663 Posted September 17, 2021 (edited) 9 minutes ago, WiteWulf said: Interesting that you're seeing this in ESXi, others have only reported it on baremetal so far. Are those messages from ESXi, or the xpenolgy guest? What are you seeing on the serial output from the xpenology guest? FWIW I was seeing output like this each time my system kernel panic'd and rebooted: [ 194.017549] Kernel panic - not syncing: Watchdog detected hard LOCKUP on cpu 2 [ 194.052575] CPU: 2 PID: 19580 Comm: containerd-shim Tainted: PF O 3.10.105 #25556 [ 194.094270] Hardware name: HP ProLiant MicroServer Gen8, BIOS J06 04/04/2019 [ 194.128400] ffffffff814c904d ffffffff814c8121 0000000000000010 ffff880109a88d58 [ 194.164811] ffff880109a88cf0 0000000000000000 0000000000000002 000000000000002b [ 194.201332] 0000000000000002 ffffffff80000001 0000000000000010 ffff880103ee5c00 [ 194.238138] Call Trace: [ 194.250471] <NMI> [<ffffffff814c904d>] ? dump_stack+0xc/0x15 [ 194.279225] [<ffffffff814c8121>] ? panic+0xbb/0x1ce [ 194.304100] [<ffffffff810a0922>] ? watchdog_overflow_callback+0xb2/0xc0 [ 194.337400] [<ffffffff810b152b>] ? __perf_event_overflow+0x8b/0x240 [ 194.368795] [<ffffffff810b02d4>] ? perf_event_update_userpage+0x14/0xf0 [ 194.401338] [<ffffffff81015411>] ? intel_pmu_handle_irq+0x1d1/0x360 [ 194.432957] [<ffffffff81010026>] ? perf_event_nmi_handler+0x26/0x40 [ 194.464708] [<ffffffff81005fa8>] ? do_nmi+0xf8/0x3e0 [ 194.488902] [<ffffffff814cfa53>] ? end_repeat_nmi+0x1e/0x7e [ 194.517219] <<EOE>> [ 195.556746] Shutting down cpus with NMI [ 195.576047] Rebooting in 3 seconds.. Most of the time is was containerd-shim that was named in the messaging, but sometimes it was the specific process within the container, influxdb. It was always my influxdb container causing the crashes, and my system has been very stable since I stopped trying to run it. I have done a lot of searching on the internet and can find no mention of influxdb in docker causing kernel panics, therefore I'm almost certain that this is unique to redpill. I used influxdb for years on xpenology using Jun's bootloader and have only seen this moving to redpill, on 6.2.4 and 7.0.1-RC1. Can we more this to the other topic I started, though? We're cluttering up the thread here... https://xpenology.com/forum/topic/47970-docker-causing-kernel-panics-after-move-to-624-on-redpill/ Picture from xpenology guest. The hint about IPV6 might be interesting : Quote [ 19.778392] Synotify use 16384 event queue size [ 20.030329] ip6_tables: (C) 2000-2006 Netfilter Core Team [ 20.068195] aufs 3.10.x-20141110 [ 20.076943] Bridge firewalling registered [ 20.086435] cgroup: systemd (1) created nested cgroup for controller "blkio" which has incomplete hierarchy support. Nested cgroups may change behavior in the future. [ 20.646395] <redpill/pmu_shim.c:310> Got 2 bytes from PMU: reason=1 hex={2d 33} ascii="-3" [ 20.647879] <redpill/pmu_shim.c:239> Executing cmd OUT_BUZ_LONG handler cmd_shim_noop+0x0/0x30 [redpill] [ 20.649544] <redpill/pmu_shim.c:45> vPMU received OUT_BUZ_LONG using 1 bytes - NOOP [ 21.143474] fuse init (API version 7.22) [ 21.466668] Initializing XFRM netlink socket [ 21.469339] Netfilter messages via NETLINK v0.30. [ 21.539973] IPv6: ADDRCONF(NETDEV_UP): docker0: link is not ready [ 22.646774] <redpill/pmu_shim.c:310> Got 2 bytes from PMU: reason=1 hex={2d 38} ascii="-8" [ 22.648188] <redpill/pmu_shim.c:239> Executing cmd OUT_STATUS_LED_ON_GREEN handler cmd_shim_noop+0x0/0x30 [redpill] [ 22.649863] <redpill/pmu_shim.c:45> vPMU received OUT_STATUS_LED_ON_GREEN using 1 bytes - NOOP [ 33.913043] Synotify use 16384 event queue size [ 34.650817] Synotify use 16384 event queue size [ 63.071479] device docker8d303b8 entered promiscuous mode [ 63.072593] IPv6: ADDRCONF(NETDEV_UP): docker8d303b8: link is not ready [ 63.602699] IPv6: ADDRCONF(NETDEV_CHANGE): docker8d303b8: link becomes ready [ 63.604006] docker0: port 1(docker8d303b8) entered forwarding state [ 63.605095] docker0: port 1(docker8d303b8) entered forwarding state [ 63.606410] IPv6: ADDRCONF(NETDEV_CHANGE): docker0: link becomes ready [ 72.294676] <redpill/smart_shim.c:794> Handling ioctl(0x31f) for /dev/sda [ 72.296079] <redpill/smart_shim.c:624> Got SMART *command* - looking for feature=0xd0 [ 72.297396] <redpill/smart_shim.c:376> Generating fake SMART values [ 78.645421] docker0: port 1(docker8d303b8) entered forwarding state and system froze. Disabling DSM IPV6 seems to fix the issue... I'ill leave container running to confirm Edited September 17, 2021 by Orphée Link to comment Share on other sites More sharing options...
WiteWulf Posted September 17, 2021 Share #1664 Posted September 17, 2021 (edited) IPv6 is disabled in control panel on my system: ...but docker still tries to setup IPv6 networking for some or all containers when they start. IPv6 is also disabled on the docker networks on my setup: Edited September 17, 2021 by WiteWulf Link to comment Share on other sites More sharing options...
pocopico Posted September 17, 2021 Share #1665 Posted September 17, 2021 On my Gen7 i have a strange issue. Whenever i perform shutdown the system halts OK but it doesnt get the signal to power off. How is shutdown in baremetal working for you ? Link to comment Share on other sites More sharing options...
Orphée Posted September 17, 2021 Share #1666 Posted September 17, 2021 5 minutes ago, WiteWulf said: IPv6 is disabled in control panel on my system: ...but docker still tries to setup IPv6 networking for some or all containers when they start. IPv6 is also disabled on the docker networks on my setup: Same for me, but at least IPv6 disabled in DSM fix the crashes. 1 Link to comment Share on other sites More sharing options...
altas Posted September 17, 2021 Share #1667 Posted September 17, 2021 (edited) is this Docker Crashes also in 6.2.4 or only in DSM7 ? maybe docker has some problems with DSM7 i will do a test today evening with my fresh Gen8 on DSM7 and a Docker Image. Edited September 17, 2021 by altas Link to comment Share on other sites More sharing options...
shibby Posted September 17, 2021 Share #1668 Posted September 17, 2021 5 minutes ago, altas said: maybe docker has some problems with DSM7 i have DS218+ and DS918+ with DSM7 and no issue with docker. Link to comment Share on other sites More sharing options...
nemesis122 Posted September 17, 2021 Share #1669 Posted September 17, 2021 lol it is bit strange doing some stupid things reload the login page starting docker jd2 file download smb copy with 300 MB but the server will not crash since i have enabedl the ilo console in putty creating docker etc using the hammer but the server will not crash Link to comment Share on other sites More sharing options...
nemesis122 Posted September 17, 2021 Share #1670 Posted September 17, 2021 (edited) I have disabled IPV 6 in the networkadapter also on Docker but the server still reboot saw this message at least: [ 817.217981] CPU: 0 PID: 2187 Comm: runc Tainted: PF C O 3.10.108 #42214 [ 817.253128] Hardware name: HP ProLiant MicroServer Gen8, BIOS J06 11/02/2015 [ 817.287675] ffffffff814a2759 ffffffff814a16b1 0000000000000010 ffff880309a08d60 [ 817.324215] ffff880309a08cf8 0000000000000000 0000000000000000 0000000000000001 [ 817.360571] 0000000000000000 ffffffff80000001 0000000000000030 ffff8802f9a1d000 [ 817.396778] Call Trace: [ 817.408374] <NMI> [<ffffffff814a2759>] ? dump_stack+0xc/0x15 [ 817.436964] [<ffffffff814a16b1>] ? panic+0xbb/0x1df [ 817.461381] [<ffffffff810a9eb8>] ? watchdog_overflow_callback+0xa8/0xb0 [ 817.494556] [<ffffffff810db7d3>] ? __perf_event_overflow+0x93/0x230 [ 817.525829] [<ffffffff810da612>] ? perf_event_update_userpage+0x12/0xf0 [ 817.558713] [<ffffffff810152a4>] ? intel_pmu_handle_irq+0x1b4/0x340 [ 817.590098] [<ffffffff814a9d06>] ? perf_event_nmi_handler+0x26/0x40 [ 817.621060] [<ffffffff814a944e>] ? do_nmi+0xfe/0x440 [ 817.645848] [<ffffffff814a8a53>] ? end_repeat_nmi+0x1e/0x7e [ 817.673689] <<EOE>> [ 817.684069] Rebooting in 3 seconds.. after the reboot: [ 141.234109] IPv6: ADDRCONF(NETDEV_UP): docker0: link is not ready but in DSM IPV 6 is disabled During the Boot process after the server has crashed i found this : FAILED 65.563176] Model: DS-3615xs m] Failed to sta[ 65.563177] set group disks wakeup number to 4, spinup time deno 7 rt Adjust NIC se[ 65.563183] synobios cpu_arch proc entry initialized [ 65.563184] synobios crypto_hw proc entry initialized [ 65.563187] synobios syno_platform proc entry initialized quence. See "sy[ 65.563190] <redpill/bios_shims_collection.c:51> Will print 416 bytes of memory from ffffffffa0986ca0 [ 65.563199] 20 6f 98 a0 ff ff ff ff [00] 0x000 ffffffffa0986f20 __this_module+0x0/0xffffffffffff8e13 [bromolow_synobios] stemctl status S[ 65.563208] c0 e7 97 a0 ff ff ff ff [01] 0x008 ffffffffa097e7c0 GetBrand+0x0/0x10 [bromolow_synobios] [ 65.563212] c0 f0 97 a0 ff ff ff ff [02] 0x010 ffffffffa097f0c0 GetModel+0x0/0x260 [bromolow_synobios] [ 65.563215] 00 00 00 00 00 00 00 00 [03] 0x018 (null) (null) [ 65.563219] d0 e1 97 a0 ff ff ff ff [04] 0x020 ffffffffa097e1d0 rtc_bandon_get_time+0x0/0x1a0 [bromolow_synobios] [ 65.563224] 20 e4 97 a0 ff ff ff ff [05] 0x028 ffffffffa097e420 rtc_bandon_set_time+0x0/0x370 [bromolow_synobios] [ 65.563226] 00 00 00 00 00 00 00 00 [06] 0x030 (null) (null) [ 65.563240] f0 b0 00 a0 ff ff ff ff [07] 0x038 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563245] e0 ef 97 a0 ff ff ff ff [08] 0x040 ffffffffa097efe0 GetSysTemperature+0x0/0x20 [bromolow_synobios] [ 65.563248] 80 f5 97 a0 ff ff ff ff [09] 0x048 ffffffffa097f580 GetCpuTemperatureDenlowI3Transfer+0x0/0x80 [bromolow_synobios] [ 65.563253] 30 f5 97 a0 ff ff ff ff [10] 0x050 ffffffffa097f530 SetDiskLedStatusBy9235GPIOandAHCISGPIO+0x0/0x50 [bromolow_synobios] [ 65.563257] f0 b0 00 a0 ff ff ff ff [11] 0x058 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563259] 00 00 00 00 00 00 00 00 [12] 0x060 (null) (null) [ 65.563262] 00 00 00 00 00 00 00 00 [13] 0x068 (null) (null) [ 65.563264] 00 00 00 00 00 00 00 00 [14] 0x070 (null) (null) [ 65.563268] f0 b0 00 a0 ff ff ff ff [15] 0x078 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563273] 00 b1 00 a0 ff ff ff ff [16] 0x080 ffffffffa000b100 shim_get_gpio_pin_usable+0x0/0x10 [redpill] [ 65.563278] e0 b3 00 a0 ff ff ff ff [17] 0x088 ffffffffa000b3e0 shim_null_zero_ulong_trace+0x0/0x10 [redpill] [ 65.563282] 70 e3 97 a0 ff ff ff ff [18] 0x090 ffffffffa097e370 rtc_bandon_set_auto_poweron+0x0/0xb0 [bromolow_synobios] [ 65.563286] 20 df 97 a0 ff ff ff ff [19] 0x098 ffffffffa097df20 rtc_get_auto_poweron+0x0/0x50 [bromolow_synobios] [ 65.563289] 00 00 00 00 00 00 00 00 [20] 0x0a0 (null) (null) ynoInitEth.servi[ 65.563290] 00 00 00 00 00 00 00 00 [21] 0x0a8 (null) (null) [ 65.563295] f0 b0 00 a0 ff ff ff ff [22] 0x0b0 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563300] f0 b0 00 a0 ff ff ff ff [23] 0x0b8 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563304] f0 b0 00 a0 ff ff ff ff [24] 0x0c0 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563308] 60 e8 97 a0 ff ff ff ff [25] 0x0c8 ffffffffa097e860 GetPowerStatus+0x0/0x30 [bromolow_synobios] [ 65.563310] 00 00 00 00 00 00 00 00 [26] 0x0d0 (null) (null) [ 65.563314] d0 e7 97 a0 ff ff ff ff [27] 0x0d8 ffffffffa097e7d0 InitModuleType+0x0/0x30 [bromolow_synobios] [ 65.563318] d0 ed 97 a0 ff ff ff ff [28] 0x0e0 ffffffffa097edd0 Uninitialize+0x0/0x20 [bromolow_synobios] [ 65.563322] f0 b0 00 a0 ff ff ff ff [29] 0x0e8 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563327] f0 b0 00 a0 ff ff ff ff [30] 0x0f0 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563331] f0 b0 00 a0 ff ff ff ff [31] 0x0f8 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563334] 00 00 00 00 00 00 00 00 [32] 0x100 (null) (null) [ 65.563338] f0 b0 00 a0 ff ff ff ff [33] 0x108 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563342] f0 b0 00 a0 ff ff ff ff [34] 0x110 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563345] 00 00 00 00 00 00 00 00 [35] 0x118 (null) (null) [ 65.563347] 00 00 00 00 00 00 00 00 [36] 0x120 (null) (null) [ 65.563349] 00 00 00 00 00 00 00 00 [37] 0x128 (null) (null) [ 65.563351] 00 00 00 00 00 00 00 00 [38] 0x130 (null) (null) [ 65.563353] 00 00 00 00 00 00 00 00 [39] 0x138 (null) (null) [ 65.563357] 30 e9 97 a0 ff ff ff ff [40] 0x140 ffffffffa097e930 GetCPUInfo+0x0/0x70 [bromolow_synobios] [ 65.563359] 00 00 00 00 00 00 00 00 [41] 0x148 (null) (null) [ 65.563361] 00 00 00 00 00 00 00 00 [42] 0x150 (null) (null) [ 65.563365] 70 ed 97 a0 ff ff ff ff [43] 0x158 ffffffffa097ed70 HWMONGetFanSpeedRPMFromADT+0x0/0x60 [bromolow_synobios] ce" for details.[ 65.563367] 00 00 00 00 00 00 00 00 [44] 0x160 (null) (null) [ 65.563371] 10 ed 97 a0 ff ff ff ff [45] 0x168 ffffffffa097ed10 HWMONGetVoltageSensorFromADT+0x0/0x60 [bromolow_synobios] [ 65.563373] 00 00 00 00 00 00 00 00 [46] 0x170 (null) (null) [ 65.563376] b0 ec 97 a0 ff ff ff ff [47] 0x178 ffffffffa097ecb0 HWMONGetThermalSensorFromADT+0x0/0x60 [bromolow_synobios] [ 65.563549] 00 00 00 00 00 00 00 00 [48] 0x180 (null) (null) [ 65.563551] 00 00 00 00 00 00 00 00 [49] 0x188 (null) (null) [ 65.563552] 00 00 00 00 00 00 00 00 [50] 0x190 (null) (null) [ 65.563554] 00 00 00 00 00 00 00 00 [51] 0x198 (null) (null) [ 65.563555] [ 65.563555] <redpill/bios_shims_collection.c:64> Finished printing memory at ffffffffa0986e40 [ 65.563559] <redpill/bios_shims_collection.c:34> mfgBIOS vtable [6] originally (null)< (null)> will now be shim_null_zero_ulong [redpill]<ffffffffa000b0f0> [ 65.563564] <redpill/bios_shims_collection.c:34> mfgBIOS vtable [10] originally SetDiskLedStatusBy9235GPIOandAHCISGPIO [bromolow_synobios]<ffffffffa097f530> will now be shim_null_zero_ulong [redpill]<ffffffffa000b0f0> [ 65.563566] <redpill/bios_shims_collection.c:109> Native RTC supported - not enabling proxy (emulate_rtc=0) [ 65.563566] <redpill/bios_shims_collection.c:51> Will print 416 bytes of memory from ffffffffa0986ca0 [ 65.563571] 20 6f 98 a0 ff ff ff ff [00] 0x000 ffffffffa0986f20 __this_module+0x0/0xffffffffffff8e13 [bromolow_synobios] [ 65.563575] c0 e7 97 a0 ff ff ff ff [01] 0x008 ffffffffa097e7c0 GetBrand+0x0/0x10 [bromolow_synobios] [ 65.563578] c0 f0 97 a0 ff ff ff ff [02] 0x010 ffffffffa097f0c0 GetModel+0x0/0x260 [bromolow_synobios] [ 65.563580] 00 00 00 00 00 00 00 00 [03] 0x018 (null) (null) [ 65.563585] d0 e1 97 a0 ff ff ff ff [04] 0x020 ffffffffa097e1d0 rtc_bandon_get_time+0x0/0x1a0 [bromolow_synobios] [ 65.563588] 20 e4 97 a0 ff ff ff ff [05] 0x028 ffffffffa097e420 rtc_bandon_set_time+0x0/0x370 [bromolow_synobios] [ 65.563593] f0 b0 00 a0 ff ff ff ff [06] 0x030 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563597] f0 b0 00 a0 ff ff ff ff [07] 0x038 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563601] e0 ef 97 a0 ff ff ff ff [08] 0x040 ffffffffa097efe0 GetSysTemperature+0x0/0x20 [bromolow_synobios] [ 65.563605] 80 f5 97 a0 ff ff ff ff [09] 0x048 ffffffffa097f580 GetCpuTemperatureDenlowI3Transfer+0x0/0x80 [bromolow_synobios] [ 65.563610] f0 b0 00 a0 ff ff ff ff [10] 0x050 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563614] f0 b0 00 a0 ff ff ff ff [11] 0x058 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563617] 00 00 00 00 00 00 00 00 [12] 0x060 (null) (null) [ 65.563619] 00 00 00 00 00 00 00 00 [13] 0x068 (null) (null) [ 65.563621] 00 00 00 00 00 00 00 00 [14] 0x070 (null) (null) [ 65.563625] f0 b0 00 a0 ff ff ff ff [15] 0x078 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563630] 00 b1 00 a0 ff ff ff ff [16] 0x080 ffffffffa000b100 shim_get_gpio_pin_usable+0x0/0x10 [redpill] [ 65.563634] e0 b3 00 a0 ff ff ff ff [17] 0x088 ffffffffa000b3e0 shim_null_zero_ulong_trace+0x0/0x10 [redpill] [ 65.563638] 70 e3 97 a0 ff ff ff ff [18] 0x090 ffffffffa097e370 rtc_bandon_set_auto_poweron+0x0/0xb0 [bromolow_synobios] [ 65.563642] 20 df 97 a0 ff ff ff ff [19] 0x098 ffffffffa097df20 rtc_get_auto_poweron+0x0/0x50 [bromolow_synobios] [ 65.563644] 00 00 00 00 00 00 00 00 [20] 0x0a0 (null) (null) [ 65.563646] 00 00 00 00 00 00 00 00 [21] 0x0a8 (null) (null) [ 65.563651] f0 b0 00 a0 ff ff ff ff [22] 0x0b0 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563655] f0 b0 00 a0 ff ff ff ff [23] 0x0b8 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563660] f0 b0 00 a0 ff ff ff ff [24] 0x0c0 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563663] 60 e8 97 a0 ff ff ff ff [25] 0x0c8 ffffffffa097e860 GetPowerStatus+0x0/0x30 [bromolow_synobios] [ 65.563666] 00 00 00 00 00 00 00 00 [26] 0x0d0 (null) (null) [ 65.563669] d0 e7 97 a0 ff ff ff ff [27] 0x0d8 ffffffffa097e7d0 InitModuleType+0x0/0x30 [bromolow_synobios] [ 65.563673] d0 ed 97 a0 ff ff ff ff [28] 0x0e0 ffffffffa097edd0 Uninitialize+0x0/0x20 [bromolow_synobios] [ 65.563678] f0 b0 00 a0 ff ff ff ff [29] 0x0e8 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563682] f0 b0 00 a0 ff ff ff ff [30] 0x0f0 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563687] f0 b0 00 a0 ff ff ff ff [31] 0x0f8 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563689] 00 00 00 00 00 00 00 00 [32] 0x100 (null) (null) [ 65.563693] f0 b0 00 a0 ff ff ff ff [33] 0x108 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563698] f0 b0 00 a0 ff ff ff ff [34] 0x110 ffffffffa000b0f0 shim_null_zero_ulong+0x0/0x10 [redpill] [ 65.563700] 00 00 00 00 00 00 00 00 [35] 0x118 (null) (null) [ 65.563702] 00 00 00 00 00 00 00 00 [36] 0x120 (null) (null) [ 65.563704] 00 00 00 00 00 00 00 00 [37] 0x128 (null) (null) [ 65.563706] 00 00 00 00 00 00 00 00 [38] 0x130 (null) (null) [ 65.563708] 00 00 00 00 00 00 00 00 [39] 0x138 (null) (null) [ 65.563712] 30 e9 97 a0 ff ff ff ff [40] 0x140 ffffffffa097e930 GetCPUInfo+0x0/0x70 [bromolow_synobios] [ 65.563714] 00 00 00 00 00 00 00 00 [41] 0x148 (null) (null) [ 65.563716] 00 00 00 00 00 00 00 00 [42] 0x150 (null) (null) [ 65.563720] 70 ed 97 a0 ff ff ff ff [43] 0x158 ffffffffa097ed70 HWMONGetFanSpeedRPMFromADT+0x0/0x60 [bromolow_synobios] [ 65.563722] 00 00 00 00 00 00 00 00 [44] 0x160 (null) (null) [ 65.563726] 10 ed 97 a0 ff ff ff ff [45] 0x168 ffffffffa097ed10 HWMONGetVoltageSensorFromADT+0x0/0x60 [bromolow_synobios] [ 65.563728] 00 00 00 00 00 00 00 00 [46] 0x170 (null) (null) [ 65.563732] b0 ec 97 a0 ff ff ff ff [47] 0x178 ffffffffa097ecb0 HWMONGetThermalSensorFromADT+0x0/0x60 [bromolow_synobios] [ 65.563734] 00 00 00 00 00 00 00 00 [48] 0x180 (null) (null) [ 65.563736] 00 00 00 00 00 00 00 00 [49] 0x188 (null) (null) [ 65.563738] 00 00 00 00 00 00 00 00 [50] 0x190 (null) (null) [ 65.563740] 00 00 00 00 00 00 00 00 [51] 0x198 (null) (null) [ 65.563740] [ 65.563741] <redpill/bios_shims_collection.c:64> Finished printing memory at ffffffffa0986e40 [ 65.563741] <redpill/bios_shim.c:124> bromolow_synobios BIOS *fully* shimmed Edited September 17, 2021 by nemesis122 Link to comment Share on other sites More sharing options...
Orphée Posted September 17, 2021 Share #1671 Posted September 17, 2021 (edited) Yes actually, system is still unstable... Lot of logs like this : Quote [ 5507.900018] INFO: task kworker/u4:1:22566 blocked for more than 120 seconds. [ 5507.901167] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 5507.902445] kworker/u4:1 D ffff88013dd12f40 0 22566 2 0x00000000 [ 5507.903661] Workqueue: btrfs-worker btrfs_worker_helper [btrfs] [ 5507.904657] ffff880102e5fa90 0000000000000046 000000000000c000 ffff880102e5ffd8 [ 5507.905955] ffff880102e5ffd8 ffff8801353de040 0000000000000000 ffff880102e5fa90 [ 5507.907251] ffffffff8105f934 ffff880137deb800 ffff880137deb800 0000000000000040 [ 5507.908564] Call Trace: [ 5507.908978] [<ffffffff8105f934>] ? __wake_up+0x34/0x50 [ 5507.909836] [<ffffffff813c28c5>] ? md_write_start+0xa5/0x190 [ 5507.910789] [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60 [ 5507.911802] [<ffffffff813ba1fe>] ? make_request+0x7e/0xe30 [ 5507.912714] [<ffffffffa02248c8>] ? flashcache_map_core.isra.18+0x2b8/0x1640 [flashcache_syno] [ 5507.914109] [<ffffffff811211dd>] ? cache_alloc_refill+0x6d/0x6a0 [ 5507.915103] [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60 [ 5507.916141] [<ffffffff813bdade>] ? md_handle_request+0x8e/0xe0 [ 5507.917110] [<ffffffff813c66b1>] ? md_make_request+0x251/0x460 [ 5507.918082] [<ffffffff812705d9>] ? generic_make_request+0xd9/0x2b0 [ 5507.919103] [<ffffffff81270827>] ? submit_bio+0x77/0x190 [ 5507.919991] [<ffffffffa0214150>] ? flashcache_merge+0x40/0xd0 [flashcache_syno] [ 5507.921196] [<ffffffffa01da44d>] ? dm_merge_bvec+0xcd/0xf0 [dm_mod] [ 5507.922224] [<ffffffffa0aa4e68>] ? __btrfs_map_bio+0x1e8/0x540 [btrfs] [ 5507.923301] [<ffffffffa0a6360e>] ? btrfs_csum_one_bio+0x2ee/0x330 [btrfs] [ 5507.924450] [<ffffffffa0a74df7>] ? __btrfs_submit_bio_done+0x17/0x40 [btrfs] [ 5507.925618] [<ffffffffa0a65d43>] ? run_one_async_start+0x23/0x30 [btrfs] [ 5507.926726] [<ffffffffa0aa93e7>] ? btrfs_worker_helper+0x127/0x3b0 [btrfs] [ 5507.927866] [<ffffffff8105f553>] ? worker_run_work+0xa3/0xf0 [ 5507.928814] [<ffffffffa0aa92c0>] ? __btrfs_alloc_workqueue+0x160/0x160 [btrfs] [ 5507.930003] [<ffffffff81050fce>] ? process_one_work+0x14e/0x520 [ 5507.930985] [<ffffffff81051d98>] ? worker_thread+0x108/0x420 [ 5507.931928] [<ffffffff81051c90>] ? manage_workers.isra.30+0x260/0x260 [ 5507.932991] [<ffffffff81057b01>] ? kthread+0xb1/0xc0 [ 5507.933829] [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160 [ 5507.934823] [<ffffffff814afe0d>] ? ret_from_fork+0x5d/0xb0 [ 5507.935736] [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160 [ 5627.839951] INFO: task btrfs-transacti:8021 blocked for more than 120 seconds. [ 5627.841248] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 5627.842544] btrfs-transacti D ffff88013dd12f40 0 8021 2 0x00000000 [ 5627.843763] ffff88012800b790 0000000000000046 000000000000c000 ffff88012800bfd8 [ 5627.845084] ffff88012800bfd8 ffff880134d34040 ffff8801248a8e00 ffff88013338f400 [ 5627.846401] 0000000002d40000 ffff880137deb800 ffff880137deb800 0000000000000080 [ 5627.847712] Call Trace: [ 5627.848139] [<ffffffff813c28c5>] ? md_write_start+0xa5/0x190 [ 5627.849096] [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60 [ 5627.850149] [<ffffffff813ba1fe>] ? make_request+0x7e/0xe30 [ 5627.851116] [<ffffffffa0acb3a5>] ? add_delayed_tree_ref.isra.15+0xd5/0x1a0 [btrfs] [ 5627.852425] [<ffffffffa02248c8>] ? flashcache_map_core.isra.18+0x2b8/0x1640 [flashcache_syno] [ 5627.853846] [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60 [ 5627.854900] [<ffffffff813bdade>] ? md_handle_request+0x8e/0xe0 [ 5627.855888] [<ffffffff813c66b1>] ? md_make_request+0x251/0x460 [ 5627.856871] [<ffffffff812705d9>] ? generic_make_request+0xd9/0x2b0 [ 5627.857908] [<ffffffff81270827>] ? submit_bio+0x77/0x190 [ 5627.858800] [<ffffffff8116963c>] ? __bio_clone+0x1c/0xa0 [ 5627.859695] [<ffffffff8116ad08>] ? bio_clone_bioset+0x28/0x60 [ 5627.860671] [<ffffffffa0aa4e68>] ? __btrfs_map_bio+0x1e8/0x540 [btrfs] [ 5627.861788] [<ffffffffa0a69408>] ? btree_submit_bio_hook+0xa8/0x160 [btrfs] [ 5627.862974] [<ffffffffa0a9328f>] ? submit_one_bio+0x5f/0x90 [btrfs] [ 5627.864036] [<ffffffffa0a96dc3>] ? submit_extent_page.constprop.52+0x123/0x290 [btrfs] [ 5627.865363] [<ffffffffa0a933b0>] ? end_extent_buffer_writeback+0x20/0x20 [btrfs] [ 5627.866602] [<ffffffffa0a9cca3>] ? write_one_eb+0x1a3/0x2e0 [btrfs] [ 5627.867663] [<ffffffffa0a933b0>] ? end_extent_buffer_writeback+0x20/0x20 [btrfs] [ 5627.868908] [<ffffffffa0a9cfab>] ? btree_write_cache_pages+0x1cb/0x3a0 [btrfs] [ 5627.870117] [<ffffffff810ee052>] ? do_writepages+0x22/0x80 [ 5627.871042] [<ffffffff810e3a49>] ? __filemap_fdatawrite_range+0x49/0x50 [ 5627.872154] [<ffffffffa0a71973>] ? btrfs_write_marked_extents+0xc3/0x190 [btrfs] [ 5627.873398] [<ffffffffa0a7253c>] ? btrfs_commit_transaction+0x7cc/0xcc0 [btrfs] [ 5627.874624] [<ffffffffa0a6ce3d>] ? transaction_kthread+0x26d/0x2e0 [btrfs] [ 5627.875778] [<ffffffffa0a6cbd0>] ? btrfs_cleanup_transaction+0x5c0/0x5c0 [btrfs] [ 5627.877010] [<ffffffff81057b01>] ? kthread+0xb1/0xc0 [ 5627.877845] [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160 [ 5627.878857] [<ffffffff814afe0d>] ? ret_from_fork+0x5d/0xb0 [ 5627.879785] [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160 [ 5627.880844] INFO: task kworker/u4:1:22566 blocked for more than 120 seconds. [ 5627.882030] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 5627.883353] kworker/u4:1 D ffff88013dd12f40 0 22566 2 0x00000000 [ 5627.884584] Workqueue: btrfs-worker btrfs_worker_helper [btrfs] [ 5627.885591] ffff880102e5fa90 0000000000000046 000000000000c000 ffff880102e5ffd8 [ 5627.886927] ffff880102e5ffd8 ffff8801353de040 0000000000000000 ffff880102e5fa90 [ 5627.888243] ffffffff8105f934 ffff880137deb800 ffff880137deb800 0000000000000040 [ 5627.889563] Call Trace: [ 5627.889987] [<ffffffff8105f934>] ? __wake_up+0x34/0x50 [ 5627.890848] [<ffffffff813c28c5>] ? md_write_start+0xa5/0x190 [ 5627.891797] [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60 [ 5627.892843] [<ffffffff813ba1fe>] ? make_request+0x7e/0xe30 [ 5627.893774] [<ffffffffa02248c8>] ? flashcache_map_core.isra.18+0x2b8/0x1640 [flashcache_syno] [ 5627.895178] [<ffffffff811211dd>] ? cache_alloc_refill+0x6d/0x6a0 [ 5627.896179] [<ffffffff81058430>] ? wake_atomic_t_function+0x60/0x60 [ 5627.897224] [<ffffffff813bdade>] ? md_handle_request+0x8e/0xe0 [ 5627.898200] [<ffffffff813c66b1>] ? md_make_request+0x251/0x460 [ 5627.899174] [<ffffffff812705d9>] ? generic_make_request+0xd9/0x2b0 [ 5627.900204] [<ffffffff81270827>] ? submit_bio+0x77/0x190 [ 5627.901104] [<ffffffffa0214150>] ? flashcache_merge+0x40/0xd0 [flashcache_syno] [ 5627.902320] [<ffffffffa01da44d>] ? dm_merge_bvec+0xcd/0xf0 [dm_mod] [ 5627.903372] [<ffffffffa0aa4e68>] ? __btrfs_map_bio+0x1e8/0x540 [btrfs] [ 5627.904478] [<ffffffffa0a6360e>] ? btrfs_csum_one_bio+0x2ee/0x330 [btrfs] [ 5627.905621] [<ffffffffa0a74df7>] ? __btrfs_submit_bio_done+0x17/0x40 [btrfs] [ 5627.906805] [<ffffffffa0a65d43>] ? run_one_async_start+0x23/0x30 [btrfs] [ 5627.908116] [<ffffffffa0aa93e7>] ? btrfs_worker_helper+0x127/0x3b0 [btrfs] [ 5627.909261] [<ffffffff8105f553>] ? worker_run_work+0xa3/0xf0 [ 5627.910219] [<ffffffffa0aa92c0>] ? __btrfs_alloc_workqueue+0x160/0x160 [btrfs] [ 5627.911426] [<ffffffff81050fce>] ? process_one_work+0x14e/0x520 [ 5627.912420] [<ffffffff81051d98>] ? worker_thread+0x108/0x420 [ 5627.913369] [<ffffffff81051c90>] ? manage_workers.isra.30+0x260/0x260 [ 5627.914442] [<ffffffff81057b01>] ? kthread+0xb1/0xc0 [ 5627.915284] [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160 [ 5627.916288] [<ffffffff814afe0d>] ? ret_from_fork+0x5d/0xb0 [ 5627.917208] [<ffffffff81057a50>] ? kthread_worker_fn+0x160/0x160 Edit : CPU also does not like at all the docker container running : I had to reset VM. Edited September 17, 2021 by Orphée Link to comment Share on other sites More sharing options...
WiteWulf Posted September 17, 2021 Share #1672 Posted September 17, 2021 1 hour ago, altas said: is this Docker Crashes also in 6.2.4 or only in DSM7 ? maybe docker has some problems with DSM7 i will do a test today evening with my fresh Gen8 on DSM7 and a Docker Image. I saw these crashes when I first migrated to 6.2.4 and they persisted into 7.0.1-RC1 Link to comment Share on other sites More sharing options...
WiteWulf Posted September 17, 2021 Share #1673 Posted September 17, 2021 34 minutes ago, nemesis122 said: [ 817.217981] CPU: 0 PID: 2187 Comm: runc Tainted: PF C O 3.10.108 #42214 [ 817.253128] Hardware name: HP ProLiant MicroServer Gen8, BIOS J06 11/02/2015 [ 817.287675] ffffffff814a2759 ffffffff814a16b1 0000000000000010 ffff880309a08d60 [ 817.324215] ffff880309a08cf8 0000000000000000 0000000000000000 0000000000000001 [ 817.360571] 0000000000000000 ffffffff80000001 0000000000000030 ffff8802f9a1d000 [ 817.396778] Call Trace: [ 817.408374] <NMI> [<ffffffff814a2759>] ? dump_stack+0xc/0x15 [ 817.436964] [<ffffffff814a16b1>] ? panic+0xbb/0x1df [ 817.461381] [<ffffffff810a9eb8>] ? watchdog_overflow_callback+0xa8/0xb0 [ 817.494556] [<ffffffff810db7d3>] ? __perf_event_overflow+0x93/0x230 [ 817.525829] [<ffffffff810da612>] ? perf_event_update_userpage+0x12/0xf0 [ 817.558713] [<ffffffff810152a4>] ? intel_pmu_handle_irq+0x1b4/0x340 [ 817.590098] [<ffffffff814a9d06>] ? perf_event_nmi_handler+0x26/0x40 [ 817.621060] [<ffffffff814a944e>] ? do_nmi+0xfe/0x440 [ 817.645848] [<ffffffff814a8a53>] ? end_repeat_nmi+0x1e/0x7e [ 817.673689] <<EOE>> [ 817.684069] Rebooting in 3 seconds.. Thanks for grabbing the console output 👍 This actually looks a little different to what I've been seeing (crashes with containerd-shim). Yours is indicating a problem with 'runs', and seems related to a thread I found while searching this morning: https://github.com/opencontainers/runc/issues/2530 This still manifests as problems with containerd. Some people are seeing reboots, some are simply seeing lockups. You really ought to update the BIOS on your Gen8, by the way , and check out whatever other firmware updates are available for it... 1 Link to comment Share on other sites More sharing options...
WiteWulf Posted September 17, 2021 Share #1674 Posted September 17, 2021 @Orphée I'm not seeing excessive CPU usage on my baremetal install, fwiw. Can you have a look at the output of 'top' and see what process(es) are responsible? Link to comment Share on other sites More sharing options...
Orphée Posted September 17, 2021 Share #1675 Posted September 17, 2021 System was not responding while high CPU usage. I'm not able to reproduce the issue ... docker started again... The only difference between last crash and now is I did not reboot after disabling IPv6 earlier... Whereas now the system fresh started with IPv6 disabled... Don't know if it is related... but currently seems to be stable. 1 Link to comment Share on other sites More sharing options...
Recommended Posts