页次: 1
我發現這個問題國外的論壇已經很多人回報Bug了
有些有解決有些沒有
我英文能力沒有說很好,有人知道解法嗎?
我電腦目前顯卡的驅動方式是Intel搭Nvidia Prime Offload一起用
凍結時間幾秒鐘到幾分鐘都有,甚至可以當住5分鐘以上
硬體:
Asus X560UD
Nvidia GTX1050
Intel i5-8250U(KabyLake, 集成顯卡Intel UHD 620)
/etc/X11/xorg.conf.d/nvidia.conf 長這樣(沒有Xorg.conf)
Section "ServerLayout"
Identifier "layout"
Screen 0 "iGPU"
Option "AllowNVIDIAGPUScreens"
EndSection
Section "Device"
Identifier "iGPU"
Option "NoAccel" "True"
Driver "modesetting"
BusID "PCI:0:2:0"
EndSection
Section "Screen"
Identifier "iGPU"
Device "iGPU"
EndSection
Section "Device"
Identifier "dGPU"
Driver "nvidia"
BusID "PCI:1:0:0"
EndSection
其中 Option "NoAccel" "True" 是我後來加上去的,但是沒有用
用 journalctl 抓出來的錯誤訊息
12月 31 14:07:29 YsPan-X560UD-Linux kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5
12月 31 14:07:29 YsPan-X560UD-Linux kernel: pcieport 0000:00:1c.5: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
12月 31 14:07:29 YsPan-X560UD-Linux kernel: pcieport 0000:00:1c.5: AER: device [8086:9d15] error status/mask=00001000/00002000
12月 31 14:07:29 YsPan-X560UD-Linux kernel: pcieport 0000:00:1c.5: AER: [12] Timeout
12月 31 14:07:36 YsPan-X560UD-Linux kernel: i915 0000:00:02.0: GPU HANG: ecode 9:1:0x00000000, hang on rcs0
12月 31 14:07:36 YsPan-X560UD-Linux kernel: GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
12月 31 14:07:36 YsPan-X560UD-Linux kernel: Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
12月 31 14:07:36 YsPan-X560UD-Linux kernel: drm/i915 developers can then reassign to the right component if it's not a kernel issue.
12月 31 14:07:36 YsPan-X560UD-Linux kernel: The GPU crash dump is required to analyze GPU hangs, so please always attach it.
12月 31 14:07:36 YsPan-X560UD-Linux kernel: GPU crash dump saved to /sys/class/drm/card1/error
12月 31 14:07:36 YsPan-X560UD-Linux kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
12月 31 14:07:36 YsPan-X560UD-Linux /usr/lib/gdm-x-session[815]: (II) event10 - USB OPTICAL MOUSE: SYN_DROPPED event - some input events have been lost.
GPU的Coredump
GPU HANG: ecode 9:1:0x00000000, hang on rcs0
Kernel: 5.4.6-arch3-1 x86_64
Driver: 20190822
Time: 1577772456 s 96094 us
Boottime: 3775 s 564991 us
Uptime: 3774 s 236369 us
Epoch: 4296009920 jiffies (300 HZ)
Capture: 4296009920 jiffies; 182447 ms ago, 0 ms after epoch
Reset count: 0
Suspend count: 0
Platform: KABYLAKE
Subplatform: 0x0
PCI ID: 0x5917
PCI Revision: 0x07
PCI Subsystem: 1043:11e1
IOMMU enabled?: 0
DMC loaded: yes
DMC fw version: 1.4
GT awake: yes
RPM wakelock: yes
PM suspended: no
EIR: 0x00000000
IER: 0x08080000
GTIER[0]: 0x01010101
GTIER[1]: 0x01010101
GTIER[2]: 0x00000070
GTIER[3]: 0x00000101
PGTBL_ER: 0x00000000
FORCEWAKE: 0x00010001
DERRMR: 0x2077efef
CCID: 0x00000000
fence[0] = 100603b00800001
fence[1] = 188603b01080001
fence[2] = 20c603b018c0001
fence[3] = 290603b02100001
fence[4] = 314603b02940001
fence[5] = 39c603b031c0001
fence[6] = 420603b03a00001
fence[7] = 4a4603b04240001
fence[8] = 00000000
fence[9] = 00000000
fence[10] = 00000000
fence[11] = 00000000
fence[12] = 00000000
fence[13] = 00000000
fence[14] = 00000000
fence[15] = 00000000
fence[16] = 00000000
fence[17] = 00000000
fence[18] = 00000000
fence[19] = 00000000
fence[20] = 00000000
fence[21] = 00000000
fence[22] = 00000000
fence[23] = 00000000
fence[24] = 00000000
fence[25] = 00000000
fence[26] = 00000000
fence[27] = 00000000
fence[28] = 00000000
fence[29] = 00000000
fence[30] = 00000000
fence[31] = 00000000
ERROR: 0x00000000
DONE_REG: 0xffffffff
FAULT_TLB_DATA: 0x0000001f 0xdbe68214
Num Pipes: 3
Pipe [0]:
Power: on
SRC: 077f0437
STAT: 00000000
Plane [0]:
CNTR: c4042400
STRIDE: 0000000f
SURF: 031c0000
TILEOFF: 00000000
Cursor [0]:
CNTR: 04000023
POS: 02c3052d
BASE: 03180000
Pipe [1]:
Power: off
SRC: 00000000
STAT: 00000000
Plane [1]:
CNTR: 00000000
STRIDE: 00000000
SURF: 00000000
TILEOFF: 00000000
Cursor [1]:
CNTR: 00000000
POS: 00000000
BASE: 00000000
Pipe [2]:
Power: off
SRC: 00000000
STAT: 00000000
Plane [2]:
CNTR: 00000000
STRIDE: 00000000
SURF: 00000000
TILEOFF: 00000000
Cursor [2]:
CNTR: 00000000
POS: 00000000
BASE: 00000000
CPU transcoder: A
Power: off
CONF: 00000000
HTOTAL: 00000000
HBLANK: 00000000
HSYNC: 00000000
VTOTAL: 00000000
VBLANK: 00000000
VSYNC: 00000000
CPU transcoder: A
Power: off
CONF: 00000000
HTOTAL: 00000000
HBLANK: 00000000
HSYNC: 00000000
VTOTAL: 00000000
VBLANK: 00000000
VSYNC: 00000000
CPU transcoder: A
Power: off
CONF: 00000000
HTOTAL: 00000000
HBLANK: 00000000
HSYNC: 00000000
VTOTAL: 00000000
VBLANK: 00000000
VSYNC: 00000000
CPU transcoder: EDP
Power: on
CONF: c0000000
HTOTAL: 08c9077f
HBLANK: 08c9077f
HSYNC: 080b07cf
VTOTAL: 046b0437
VBLANK: 046b0437
VSYNC: 0445043d
is_mobile: no
is_lp: no
require_force_probe: no
has_64bit_reloc: yes
gpu_reset_clobbers_display: no
has_reset_engine: yes
has_fpga_dbg: yes
has_global_mocs: no
has_gt_uc: yes
has_l3_dpf: no
has_llc: yes
has_logical_ring_contexts: yes
has_logical_ring_elsq: no
has_logical_ring_preemption: yes
has_pooled_eu: no
has_rc6: yes
has_rc6p: no
has_rps: yes
has_runtime_pm: yes
has_snoop: no
has_coherent_ggtt: yes
unfenced_needs_alignment: no
hws_needs_physical: no
cursor_needs_physical: no
has_csr: yes
has_ddi: yes
has_dp_mst: yes
has_fbc: yes
has_gmch: no
has_hotplug: yes
has_ipc: yes
has_modular_fia: no
has_overlay: no
has_psr: yes
overlay_needs_physical: no
supports_tv: no
Has logical contexts? yes
scheduler: 1f
slice0: 3 subslice(s) (0x7):
subslice0: 8 EUs (0xff)
subslice1: 8 EUs (0xff)
subslice2: 8 EUs (0xff)
subslice3: 0 EUs (0x0)
slice1: 0 subslice(s) (0x0):
subslice0: 0 EUs (0x0)
subslice1: 0 EUs (0x0)
subslice2: 0 EUs (0x0)
subslice3: 0 EUs (0x0)
slice2: 0 subslice(s) (0x0):
subslice0: 0 EUs (0x0)
subslice1: 0 EUs (0x0)
subslice2: 0 EUs (0x0)
subslice3: 0 EUs (0x0)
i915.vbt_firmware=(null)
i915.modeset=-1
i915.lvds_channel_mode=0
i915.panel_use_ssc=-1
i915.vbt_sdvo_panel_type=-1
i915.enable_dc=-1
i915.enable_fbc=1
i915.enable_psr=-1
i915.disable_power_well=1
i915.enable_ips=1
i915.invert_brightness=0
i915.enable_guc=0
i915.guc_log_level=-1
i915.guc_firmware_path=(null)
i915.huc_firmware_path=(null)
i915.dmc_firmware_path=(null)
i915.mmio_debug=0
i915.edp_vswing=0
i915.reset=2
i915.inject_load_failure=0
i915.fastboot=-1
i915.enable_dpcd_backlight=0
i915.force_probe=*
i915.alpha_support=yes
i915.enable_hangcheck=yes
i915.prefault_disable=no
i915.load_detect_test=no
i915.force_reset_modeset_test=no
i915.error_capture=yes
i915.disable_display=no
i915.verbose_state_checks=yes
i915.nuclear_pageflip=no
i915.enable_dp_mst=yes
i915.enable_gvt=no
GuC firmware:
status: DISABLED
version: wanted 33.0, found 0.0
uCode: 0 bytes
RSA: 0 bytes
HuC firmware: (null)
status: N/A
version: wanted 0.0, found 0.0
uCode: 0 bytes
RSA: 0 bytes
离线
是不是 5.4 内核的问题呢。你要不试试 4.19 看看能不能复现?我一直用的 4.19,没有遇到问题。
离线
我後來有關掉一些Flag
例如Rasterization相關的,讓他回到Software Render
好像狀況有改善了
穩定很多
离线
页次: 1