Occasional Boot Problems

Hello,

about 25% of attempts to boot my Thinkpad t495s fail, it just stops after Grub output and I stare to a black screen.
I have uploaded two GitHub gists, one of journactl -b-0 (successful boot) and -b-1 (failed boot).

https://gist.github.com/PhilippHochmann/0f55b92a2d242287f1ae7aed2147d4f0#file-failedboot-txt

https://gist.github.com/PhilippHochmann/0f55b92a2d242287f1ae7aed2147d4f0#file-successfulboot-txt

I hope it's not a hardware problem. In the past, I have inserted the SSD (Samsung 970 EVO Plus) not far enough into the M2 slot which resulted in a loose contact and failed boots. I inserted it properly now and re-installed Manjaro after that.

I hope someone can help! If you need more info, just ask :slight_smile:
Thanks in advance

Hello and welcome to Manjaro!

So far I can only tell you that it stops dead during hardware init, but without any hint on which part might stop.

Te get more insight we need more information on the hardware and software you use at the moment, as well as which kernel parameters. Could you please post the output of

inxi --full --verbosity=7 --filter --no-host

and format it as described in this post. If you already know how to edit the kernel parameters you may want to omit the 'quiet' option.

Hello,

here is the output of the command you provided:

System:    Kernel: 5.4.40-1-MANJARO x86_64 bits: 64 compiler: gcc v: 9.3.0 Desktop: Gnome 3.36.2 info: plank 
           wm: gnome-shell dm: GDM 3.34.1 Distro: Manjaro Linux 
Machine:   Type: Laptop System: LENOVO product: 20QKS01E00 v: ThinkPad T495s serial: <filter> Chassis: type: 10 
           serial: <filter> 
           Mobo: LENOVO model: 20QKS01E00 serial: <filter> UEFI: LENOVO v: R13ET41W(1.15 ) date: 01/06/2020 
Battery:   ID-1: BAT0 charge: 14.0 Wh condition: 59.2/57.0 Wh (104%) volts: 11.2/11.6 model: LGC 02DL013 type: Li-poly 
           serial: <filter> status: Discharging cycles: 45 
Memory:    RAM: total: 13.64 GiB used: 1.17 GiB (8.5%) 
           Array-1: capacity: 64 GiB slots: 2 EC: None max module size: 32 GiB note: est. 
           Device-1: DIMM 0 size: 8 GiB speed: 2400 MT/s type: DDR4 detail: synchronous unbuffered (unregistered) 
           bus width: 64 bits total: 64 bits manufacturer: Samsung part-no: M471A1G44AB0-CTD serial: N/A 
           Device-2: DIMM 0 size: 8 GiB speed: 2400 MT/s type: DDR4 detail: synchronous unbuffered (unregistered) 
           bus width: 64 bits total: 64 bits manufacturer: Samsung part-no: M471A1G44AB0-CTD serial: N/A 
CPU:       Topology: Quad Core model: AMD Ryzen 7 PRO 3700U w/ Radeon Vega Mobile Gfx bits: 64 type: MT MCP arch: Zen+ 
           rev: 1 L1 cache: 384 KiB L2 cache: 2048 KiB L3 cache: 4096 KiB bogomips: 36738 
           Speed: 1221 MHz min/max: 1400/2300 MHz boost: enabled Core speeds (MHz): 1: 1208 2: 1218 3: 1208 4: 1221 5: 1207 
           6: 1206 7: 1301 8: 2133 
           Flags: 3dnowprefetch abm adx aes aperfmperf apic arat avic avx avx2 bmi1 bmi2 bpext clflush clflushopt clzero 
           cmov cmp_legacy constant_tsc cpb cpuid cr8_legacy cx16 cx8 de decodeassists extapic extd_apicid f16c flushbyasid 
           fma fpu fsgsbase fxsr fxsr_opt ht hw_pstate ibpb irperf lahf_lm lbrv lm mca mce misalignsse mmx mmxext monitor 
           movbe msr mtrr mwaitx nonstop_tsc nopl npt nrip_save nx osvw overflow_recov pae pat pausefilter pclmulqdq 
           pdpe1gb perfctr_core perfctr_llc perfctr_nb pfthreshold pge pni popcnt pse pse36 rdrand rdseed rdtscp rep_good 
           sep sev sha_ni skinit smap smca sme smep ssbd sse sse2 sse4_1 sse4_2 sse4a ssse3 succor svm svm_lock syscall tce 
           topoext tsc tsc_scale v_vmsave_vmload vgif vmcb_clean vme vmmcall wdt xgetbv1 xsave xsavec xsaveerptr xsaveopt 
           xsaves 
Graphics:  Device-1: Advanced Micro Devices [AMD/ATI] Picasso vendor: Lenovo driver: amdgpu v: kernel bus ID: 05:00.0 
           chip ID: 1002:15d8 
           Display: server: X.org 1.20.8 driver: amdgpu compositor: gnome-shell resolution: <xdpyinfo missing> 
           OpenGL: renderer: AMD RAVEN (DRM 3.35.0 5.4.40-1-MANJARO LLVM 10.0.0) v: 4.6 Mesa 20.0.6 direct render: Yes 
Audio:     Device-1: Advanced Micro Devices [AMD/ATI] Raven/Raven2/Fenghuang HDMI/DP Audio vendor: Lenovo 
           driver: snd_hda_intel v: kernel bus ID: 05:00.1 chip ID: 1002:15de 
           Device-2: Advanced Micro Devices [AMD] Raven/Raven2/FireFlight/Renoir Audio Processor vendor: Lenovo driver: N/A 
           bus ID: 05:00.5 chip ID: 1022:15e2 
           Device-3: Advanced Micro Devices [AMD] Family 17h HD Audio vendor: Lenovo driver: snd_hda_intel v: kernel 
           bus ID: 05:00.6 chip ID: 1022:15e3 
           Sound Server: ALSA v: k5.4.40-1-MANJARO 
Network:   Device-1: Intel Wireless-AC 9260 driver: iwlwifi v: kernel bus ID: 01:00.0 chip ID: 8086:2526 
           IF: wlp1s0 state: up mac: <filter> 
           IP v4: <filter> type: dynamic noprefixroute scope: global broadcast: <filter> 
           IP v6: <filter> type: dynamic noprefixroute scope: global 
           IP v6: <filter> type: noprefixroute scope: link 
           Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: Lenovo driver: r8169 v: kernel 
           port: 2400 bus ID: 03:00.0 chip ID: 10ec:8168 
           IF: enp3s0f0 state: down mac: <filter> 
           WAN IP: <filter> 
Drives:    Local Storage: total: 465.76 GiB used: 24.73 GiB (5.3%) 
           ID-1: /dev/nvme0n1 vendor: Samsung model: SSD 970 EVO Plus 500GB size: 465.76 GiB speed: 31.6 Gb/s lanes: 4 
           serial: <filter> rev: 2B2QEXM7 scheme: GPT 
           Message: No Optical or Floppy data was found. 
RAID:      Message: No RAID data was found. 
Partition: ID-1: / size: 425.66 GiB used: 24.73 GiB (5.8%) fs: ext4 dev: /dev/nvme0n1p2 label: N/A 
           uuid: 8a0e947a-6780-4fb7-93ae-0bfbefce5c0e 
           ID-2: /boot/efi size: 299.4 MiB used: 280 KiB (0.1%) fs: vfat dev: /dev/nvme0n1p1 label: N/A uuid: BA6C-3887 
           ID-3: swap-1 size: 32.00 GiB used: 0 KiB (0.0%) fs: swap dev: /dev/nvme0n1p3 label: N/A 
           uuid: 40b6701c-3247-4fd2-a986-55f2ccc449f2 
Unmounted: Message: No unmounted partitions found. 
USB:       Hub: 1-0:1 info: Full speed (or root) Hub ports: 1 rev: 2.0 speed: 480 Mb/s chip ID: 1d6b:0002 
           Hub: 2-0:1 info: Full speed (or root) Hub ports: 4 rev: 2.0 speed: 480 Mb/s chip ID: 1d6b:0002 
           Hub: 3-0:1 info: Full speed (or root) Hub ports: 4 rev: 3.1 speed: 10 Gb/s chip ID: 1d6b:0003 
           Hub: 4-0:1 info: Full speed (or root) Hub ports: 2 rev: 2.0 speed: 480 Mb/s chip ID: 1d6b:0002 
           Device-1: 4-1:2 info: Intel type: Bluetooth driver: btusb interfaces: 2 rev: 2.0 speed: 12 Mb/s 
           chip ID: 8087:0025 
           Hub: 4-2:3 info: Genesys Logic 4-port hub ports: 4 rev: 2.0 speed: 480 Mb/s chip ID: 05e3:0610 
           Device-2: 4-2.1:4 info: IMC Networks type: Video driver: uvcvideo interfaces: 4 rev: 2.0 speed: 480 Mb/s 
           chip ID: 13d3:56ba serial: <filter> 
           Device-3: 4-2.2:5 info: Alcor Micro AU9540 Smartcard Reader type: Smart Card driver: N/A interfaces: 1 rev: 2.0 
           speed: 12 Mb/s chip ID: 058f:9540 
           Device-4: 4-2.4:6 info: Synaptics type: <vendor specific> driver: N/A interfaces: 1 rev: 2.0 speed: 12 Mb/s 
           chip ID: 06cb:00bd serial: <filter> 
           Hub: 5-0:1 info: Full speed (or root) Hub ports: 1 rev: 3.1 speed: 10 Gb/s chip ID: 1d6b:0003 
Sensors:   System Temperatures: cpu: 35.5 C mobo: 0.0 C gpu: amdgpu temp: 35 C 
           Fan Speeds (RPM): cpu: 0 
Info:      Processes: 259 Uptime: 3m Init: systemd v: 245 Compilers: gcc: 9.3.0 Shell: bash (sudo) v: 5.0.16 
           running in: gnome-terminal inxi: 3.0.37 

I have already removed "quiet" from the Grub config (and re-generated it).
Hence, I can see it easily when the boot failed and I have to restart because on a successful boot, a lot of text is printed, the the screen is cleared, and text is printed a second time. When it does not work, no text comes after the screen is cleared.

Here is my Grub config file (you can see the Kernel parameters there, right?):

GRUB_DEFAULT=saved
GRUB_TIMEOUT=10
GRUB_TIMEOUT_STYLE=hidden
GRUB_DISTRIBUTOR="Manjaro"
GRUB_CMDLINE_LINUX_DEFAULT="udev.log_priority=3 resume=UUID=40b6701c-3247-4fd2-a986-55f2ccc449f2"
GRUB_CMDLINE_LINUX=""

# If you want to enable the save default function, uncomment the following
# line, and set GRUB_DEFAULT to saved.
GRUB_SAVEDEFAULT=true

# Preload both GPT and MBR modules so that they are not missed
GRUB_PRELOAD_MODULES="part_gpt part_msdos"

# Uncomment to enable booting from LUKS encrypted devices
#GRUB_ENABLE_CRYPTODISK=y

# Uncomment to use basic console
GRUB_TERMINAL_INPUT=console

# Uncomment to disable graphical terminal
#GRUB_TERMINAL_OUTPUT=console

# The resolution used on graphical terminal
# note that you can use only modes which your graphic card supports via VBE
# you can see them in real GRUB with the command 'videoinfo'
GRUB_GFXMODE=auto

# Uncomment to allow the kernel use the same resolution used by grub
GRUB_GFXPAYLOAD_LINUX=keep

# Uncomment if you want GRUB to pass to the Linux kernel the old parameter
# format "root=/dev/xxx" instead of "root=/dev/disk/by-uuid/xxx"
#GRUB_DISABLE_LINUX_UUID=true

# Uncomment to disable generation of recovery mode menu entries
GRUB_DISABLE_RECOVERY=true

# Uncomment and set to the desired menu colors.  Used by normal and wallpaper
# modes only.  Entries specified as foreground/background.
GRUB_COLOR_NORMAL="light-gray/black"
GRUB_COLOR_HIGHLIGHT="green/black"

# Uncomment one of them for the gfx desired, a image background or a gfxtheme
#GRUB_BACKGROUND="/usr/share/grub/background.png"
#GRUB_THEME="/path/to/gfxtheme"

# Uncomment to get a beep at GRUB start
#GRUB_INIT_TUNE="480 440 1"

Thanks for your help :slight_smile:

You describe the moment when the resolution changes and the the green systemd [ OK ] messages get printed again after the GPU init. The GPU was the first thing that I thought of that might be the culprit.

As this is still quite new hardware, did you try a newer kernel? Like the linux56 (5.6.12-1 at the moment). You can change it in Manjaro via the Manjaro settings -> Kernel and then choose the right one.

1 Like

For now, the machine works fine with the newer kernel, all boots were successful. I hope it stays this way :slight_smile: Thanks for your help!

1 Like

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Forum kindly sponsored by