Hi all
Sorry, not sure what information is most appropriate here. GPU hangs from time to time on this laptop, typically when running firefox on graphics-intensive sites. Error log at the bottom. Distro is Debian 6.0.6 (squeeze), lspci
00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (rev 03) (prog-if 00 [VGA controller]) Subsystem: Acer Incorporated [ALI] Device 011f Flags: bus master, fast devsel, latency 0, IRQ 46 Memory at fc000000 (64-bit, non-prefetchable) [size=1M] Memory at d0000000 (64-bit, prefetchable) [size=256M] I/O ports at 1800 [size=8] Expansion ROM at <unassigned> [disabled] Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit- Capabilities: [d0] Power Management version 3 Kernel driver in use: i915
kernel version 3.5.2 (yes, I'll try to upgrade). Sometimes also the X-server freezes and restarts with no errors in dmesg. Is it a known problem?
Thanks Guennadi --- Guennadi Liakhovetski, Ph.D. Freelance Open-Source Software Developer http://www.open-technology.de/
[drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state [drm:i915_reset] *ERROR* Failed to reset chip. ------------[ cut here ]------------ WARNING: at drivers/gpu/drm/i915/intel_display.c:906 intel_enable_pipe+0x16c/0x1c0 [i915]() Hardware name: Extensa 5220 PLL state assertion failure (expected on, current off) Modules linked in: nls_iso8859_1 nls_cp437 vfat fat usb_storage tun crypto_blkcipher aes_generic mperf cpufreq_powersave cpufreq_userspace cpufreq_stats i915 drm_kms_helper bnep rfcomm bluetooth crc16 fuse cryptomgr aead arc4 crypto_algapi rt2800usb rt2800lib crc_ccitt rt2x00usb rt2x00lib mac80211 cfg80211 uhci_hcd sg ehci_hcd tg3 sr_mod tifm_7xx1 usbcore cdrom coretemp snd_hda_codec_realtek pcmcia psmouse tifm_core i2c_i801 yenta_socket pcmcia_rsrc snd_hda_intel hwmon libphy snd_hda_codec usb_common pcmcia_core sdhci_pci sdhci mmc_core Pid: 1780, comm: Xorg Not tainted 3.5.2 #32 Call Trace: [<c1029958>] ? warn_slowpath_common+0x78/0xb0 [<f83bb4bc>] ? intel_enable_pipe+0x16c/0x1c0 [i915] [<f83bb4bc>] ? intel_enable_pipe+0x16c/0x1c0 [i915] [<c1029a23>] ? warn_slowpath_fmt+0x33/0x40 [<f83bb4bc>] ? intel_enable_pipe+0x16c/0x1c0 [i915] [<f83c0d32>] ? i9xx_crtc_mode_set+0x942/0x1070 [i915] [<f83bd03b>] ? intel_crtc_mode_set+0x5b/0x90 [i915] [<f8118cd8>] ? drm_crtc_helper_set_mode+0x358/0x4a0 [drm_kms_helper] [<f83bf75c>] ? intel_get_load_detect_pipe+0x22c/0x3a0 [i915] [<c1096bb3>] ? find_get_page+0x63/0xc0 [<c10f572a>] ? __find_get_block+0x9a/0x1e0 [<c10f522a>] ? __find_get_block_slow+0xfa/0x1b0 [<f83d7d9b>] ? intel_tv_detect+0x9b/0x460 [i915] [<c10f588b>] ? __getblk+0x1b/0x320 [<c1215c01>] ? drm_get_connector_name+0x31/0x50 [<f8119cc8>] ? drm_helper_probe_single_connector_modes+0x258/0x310 [drm_kms_helper] [<c1215acf>] ? drm_mode_getconnector+0x33f/0x370 [<c1206223>] ? drm_ioctl+0x263/0x4e0 [<c1215790>] ? drm_mode_getencoder+0xb0/0xb0 [<c1205fc0>] ? drm_version+0x90/0x90 [<c10dbecd>] ? do_vfs_ioctl+0x8d/0x600 [<c11032d6>] ? fsnotify+0x196/0x2c0 [<c10e69c0>] ? alloc_fd+0xc0/0x110 [<c10cc8c7>] ? vfs_write+0x107/0x150 [<c10dc47d>] ? sys_ioctl+0x3d/0x70 [<c133c92c>] ? sysenter_do_call+0x12/0x22 ---[ end trace 81ff939dea1daea3 ]---
On 17.12.2012, Guennadi Liakhovetski wrote:
[drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state [drm:i915_reset] *ERROR* Failed to reset chip.
I have the same problem, are able to reproduce it and have bisected it, but the commit which git --bisect identified seems not to be the cause.
root@wildsau linux-git]# git bisect good 6c085a728cf000ac1865d66f8c9b52935558b328 is the first bad commit commit 6c085a728cf000ac1865d66f8c9b52935558b328 Author: Chris Wilson ch...@chris-wilson.co.uk Date: Mon Aug 20 11:40:46 2012 +0200
drm/i915: Track unbound pages
This is a quite nasty (3.7) regression. I have it on all of my three machines and it drives me mad (3.6.x hangs my USB 3.0 port and 3.7 my intel graphics).
Try to boot with "i915.i915_enable_rc6=0" and switch to SNA in your Xorg.conf:
Section "Device" Identifier "Card0" Driver "intel" Option "AccelMethod" "SNA" EndSection
There are tons of this "GPU hangcheck timer elapsed" messages on the net...
Good luck!
On Tue, Dec 18, 2012 at 12:15 AM, Heinz Diehl htd@fancy-poultry.org wrote:
[drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state [drm:i915_reset] *ERROR* Failed to reset chip.
I have the same problem, are able to reproduce it and have bisected it, but the commit which git --bisect identified seems not to be the cause.
root@wildsau linux-git]# git bisect good 6c085a728cf000ac1865d66f8c9b52935558b328 is the first bad commit commit 6c085a728cf000ac1865d66f8c9b52935558b328 Author: Chris Wilson ch...@chris-wilson.co.uk Date: Mon Aug 20 11:40:46 2012 +0200
drm/i915: Track unbound pages
Note that Guennadi is running kernel 3.5 which does not contain that patch - generally there are billions of ways to hang your gpu, so every gpu hang needs it's own bug (until we've analyzed the error_state and triaged the bug taking other evidence into account). Also, this is on a different gpu generation, so even more likely that it's a different kind of hang.
Thanks, Daniel
On Mon, Dec 17, 2012 at 11:36 PM, Guennadi Liakhovetski g.liakhovetski@gmx.de wrote:
Sorry, not sure what information is most appropriate here. GPU hangs from time to time on this laptop, typically when running firefox on graphics-intensive sites. Error log at the bottom. Distro is Debian 6.0.6 (squeeze), lspci
00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (rev 03) (prog-if 00 [VGA controller]) Subsystem: Acer Incorporated [ALI] Device 011f Flags: bus master, fast devsel, latency 0, IRQ 46 Memory at fc000000 (64-bit, non-prefetchable) [size=1M] Memory at d0000000 (64-bit, prefetchable) [size=256M] I/O ports at 1800 [size=8] Expansion ROM at <unassigned> [disabled] Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit- Capabilities: [d0] Power Management version 3 Kernel driver in use: i915
kernel version 3.5.2 (yes, I'll try to upgrade). Sometimes also the X-server freezes and restarts with no errors in dmesg. Is it a known problem?
Upgrading to 3.7 highly recommended, if just to get the recent pile of gpu reset fixes - those should increase the chances that the reset works and your system survives. Also worth testing is upgrading userspace gfx driver components to the latest versions (mesa + xf86-video-intel). If you still have gpu hangs with all that, please file a bug report against DRM -> DRI/Intel and please attach the i915_error_state from debugfs after your gpu hung).
Yours, Daniel
Hi Daniel
Sorry for a late reply.
On Tue, 18 Dec 2012, Daniel Vetter wrote:
On Mon, Dec 17, 2012 at 11:36 PM, Guennadi Liakhovetski g.liakhovetski@gmx.de wrote:
Sorry, not sure what information is most appropriate here. GPU hangs from time to time on this laptop, typically when running firefox on graphics-intensive sites. Error log at the bottom. Distro is Debian 6.0.6 (squeeze), lspci
00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (rev 03) (prog-if 00 [VGA controller]) Subsystem: Acer Incorporated [ALI] Device 011f Flags: bus master, fast devsel, latency 0, IRQ 46 Memory at fc000000 (64-bit, non-prefetchable) [size=1M] Memory at d0000000 (64-bit, prefetchable) [size=256M] I/O ports at 1800 [size=8] Expansion ROM at <unassigned> [disabled] Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit- Capabilities: [d0] Power Management version 3 Kernel driver in use: i915
kernel version 3.5.2 (yes, I'll try to upgrade). Sometimes also the X-server freezes and restarts with no errors in dmesg. Is it a known problem?
Upgrading to 3.7 highly recommended, if just to get the recent pile of gpu reset fixes - those should increase the chances that the reset works and your system survives.
Did that and it did work for a while, longer than the average with 3.5. I was already about to write a success report, but then it hung again yesterday. I'm not using this laptop very intensively, so, it is hard to collect statistics.
Also worth testing is upgrading userspace gfx driver components to the latest versions (mesa + xf86-video-intel).
Not sure I'll be bothered - recompiling user-space is too much of a hastle ;-) Maybe I'll just upgrade my distro altogether. Thanks for the tips anyway!
If you still have gpu hangs with all that, please file a bug report against DRM -> DRI/Intel and please attach the i915_error_state from debugfs after your gpu hung).
Yours, Daniel
Thanks Guennadi --- Guennadi Liakhovetski, Ph.D. Freelance Open-Source Software Developer http://www.open-technology.de/
On 30.12.2012, Guennadi Liakhovetski wrote:
Did that and it did work for a while, longer than the average with 3.5. I was already about to write a success report, but then it hung again yesterday. I'm not using this laptop very intensively, so, it is hard to collect statistics.
You could try to reproduce the error by writing a big file e.g.
dd if=/dev/zero of=deleteme bs=1M count=80000
or similar and watching high definition video on Youtube (1080p) or running a few instances of glxgears. That triggers a gpu hang in my case after just a couple of seconds.
In my case, the hang doesn't occur when using SNA (or a kernel < 3.7, which isn't the case with your bug). I have this in my xorg.conf:
Section "Device" Identifier "Card0" Driver "intel" Option "AccelMethod" "SNA" EndSection
Without this, every 3.7 kernel produces a gpu hang within max. 1 min.
dri-devel@lists.freedesktop.org