https://bugzilla.kernel.org/show_bug.cgi?id=60381
Summary: AMD Radeon 7770 Ghz edition Crash with DPM active Product: Drivers Version: 2.5 Kernel Version: 3.10.0-next-20130703 Platform: All OS/Version: Linux Tree: Mainline Status: NEW Severity: blocking Priority: P1 Component: Video(DRI - non Intel) AssignedTo: drivers_video-dri@kernel-bugs.osdl.org ReportedBy: jrch2k10@gmail.com Regression: No
Created an attachment (id=106731) --> (https://bugzilla.kernel.org/attachment.cgi?id=106731) dmidecode
with 1 monitor it works fine until X session but UVD freeze the system and the card never reclocks
multimonitor freeze the system the instant KMS try to load DVI + HDMI
with static PM it crashes if you try to change the profile
if you need any help to debug this issue plz provide instruction since the hangs disable everything including ssh
system info as attach
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #1 from rafael castillo jrch2k10@gmail.com 2013-07-04 00:38:04 --- Created an attachment (id=106741) --> (https://bugzilla.kernel.org/attachment.cgi?id=106741) glxinfo
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #2 from rafael castillo jrch2k10@gmail.com 2013-07-04 00:38:24 --- Created an attachment (id=106751) --> (https://bugzilla.kernel.org/attachment.cgi?id=106751) lspci
https://bugzilla.kernel.org/show_bug.cgi?id=60381
Alex Deucher alexdeucher@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |alexdeucher@gmail.com
--- Comment #3 from Alex Deucher alexdeucher@gmail.com 2013-07-04 01:01:07 --- Please attach you dmesg output and a copy of your vbios. To get a copy of your vbios:
(as root) (use lspci to get the bus id) cd /sys/bus/pci/devices/<pci bus id> echo 1 > rom cat rom > /tmp/vbios.rom echo 0 > rom
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #4 from rafael castillo jrch2k10@gmail.com 2013-07-04 01:11:51 --- thanks for your response, im attaching the vbios info but im unable to get an dmesg output in the moment of the crash since it kicks in kms load and hangs[keyboard keys blinking] and even ssh stops
and if i boot without hdmi[all clean up to here] and plug hdmi all freeze again the same way[UVD or reclock have the same issue]
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #5 from rafael castillo jrch2k10@gmail.com 2013-07-04 01:12:10 --- Created an attachment (id=106781) --> (https://bugzilla.kernel.org/attachment.cgi?id=106781) vbios.rom
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #6 from Michel Dänzer michel@daenzer.net 2013-07-04 08:20:06 --- (In reply to comment #0)
with static PM it crashes if you try to change the profile
FWIW, that might work better if you explicitly set the low profile first.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #7 from rafael castillo jrch2k10@gmail.com --- hi alex i see you posted some new patches, any of those i should test to verify the issue on this bug?
no pressure just curiosity and as always many thanks for your hard work
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #8 from rafael castillo jrch2k10@gmail.com --- well tested the latest changes in you drm-3.11-next and issue still there, just reporting
https://bugzilla.kernel.org/show_bug.cgi?id=60381
Arek Ruśniak arek.rusi@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |arek.rusi@gmail.com
--- Comment #9 from Arek Ruśniak arek.rusi@gmail.com --- Hi,
Rafael if you pluged only dvi, dpm changes power levels?
Alex, i'm not sure if i should commit new bug(i have HD7770 GHz Ed. from Asus+olny one dvi display). In my case i've got power_level = 0 (low) all the ime. It reclocks olny when i try with UVD but it kills my machine. Forces power levels works olny for "low" level in other case "auto"(default) or "high": "bash: echo: write error: Invalid argument". It doesn't matter if power state is set to 'performance' or 'balanced'.
Static PM works ok(UVD too)(with something similar to Michael's tip of course).
kernel: drm-next-3.11 or linux-next-20130712
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #10 from rafael castillo jrch2k10@gmail.com --- pretty much the same here
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #11 from Michel Dänzer michel@daenzer.net --- FWIW, I'm basically seeing the same problems with my 7770 card.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #12 from rafael castillo jrch2k10@gmail.com --- well i tried drm-fixes-3.11 branch late night very fast but i got to KDE using dpm=1 and UVD seemed to work but when i opened xonotic the GPU hard reset and killed the monitors but is getting closer ;)
today ill try to debug this issue with xonotic since i noted you made the output more verbose from my tablet ssh i was too tired last night
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #13 from Arek Ruśniak arek.rusi@gmail.com --- i can confirm this for 3d apps(lightsmark or unvanquished), but uvd still doesn't work for me.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #14 from Arek Ruśniak arek.rusi@gmail.com --- Created attachment 106895 --> https://bugzilla.kernel.org/attachment.cgi?id=106895&action=edit dmesg output - kernel drm-fixes-3.11
I don't try 3.11-rc1, but code for radeon should be the same i hope.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #15 from rafael castillo jrch2k10@gmail.com --- im adding my dmesg for drm-fixes-3.11 just in case
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #16 from rafael castillo jrch2k10@gmail.com --- Created attachment 106897 --> https://bugzilla.kernel.org/attachment.cgi?id=106897&action=edit dmesg output
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #17 from Alex Deucher alexdeucher@gmail.com --- Created attachment 106898 --> https://bugzilla.kernel.org/attachment.cgi?id=106898&action=edit debugging output
Can you attach a dmesg output with dpm enabled with this patch?
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #18 from rafael castillo jrch2k10@gmail.com --- I can`t reach X with that patch applied, ill try later with gcc 4.7 just to be sure
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #19 from rafael castillo jrch2k10@gmail.com --- well compiling with 4.7 series i can reach X, i guess ill get another fun debug for later, atacched dmesg
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #20 from rafael castillo jrch2k10@gmail.com --- Created attachment 106900 --> https://bugzilla.kernel.org/attachment.cgi?id=106900&action=edit dmesg with patch applied
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #21 from Arek Ruśniak arek.rusi@gmail.com --- Created attachment 106902 --> https://bugzilla.kernel.org/attachment.cgi?id=106902&action=edit dmesg drm-fixes-3.11+patch
It almost looks like the same as before. But You are dev here:) I'll try uvd with this patch.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #22 from rafael castillo jrch2k10@gmail.com --- Created attachment 106917 --> https://bugzilla.kernel.org/attachment.cgi?id=106917&action=edit latest drm-fixes-3.11 dmesg
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #23 from rafael castillo jrch2k10@gmail.com --- ok xonotic still crash play hell with the GPU but now it can resume after failure and for things like gpu accel in browser or normal kwin usage seems stable enough.
it seems only real 3d apps like games trigger the crash
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #24 from Alex Deucher alexdeucher@gmail.com --- You might try the latest drm-fixes branch if you were using gcc 4.8. See: https://bugs.freedesktop.org/show_bug.cgi?id=66932
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #25 from rafael castillo jrch2k10@gmail.com --- well i tried drm-fixes-3.11 with both gcc 4.8.1 and 4.7.2 and both reset the GPU when reclock, i attached the dmesg.
i meant that with your recent changes the GPU recover and allow you to close the game or in the case of chrome fallback to cpu rendering instead of hardlock the system as before, so it got better but reclock still fails.
this Cape Verde XT chips seems to be a really problematic generation or maybe is that this chip come overclocked from factory maybe
https://bugzilla.kernel.org/show_bug.cgi?id=60381
Arek Ruśniak arek.rusi@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- Attachment #106895|0 |1 is obsolete| | Attachment #106902|0 |1 is obsolete| |
--- Comment #26 from Arek Ruśniak arek.rusi@gmail.com --- Created attachment 106938 --> https://bugzilla.kernel.org/attachment.cgi?id=106938&action=edit boot with radeon.aspm=1
No change with 3d apps.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #27 from Arek Ruśniak arek.rusi@gmail.com --- Created attachment 106939 --> https://bugzilla.kernel.org/attachment.cgi?id=106939&action=edit boot with radeon.aspm=0
aspm=0 didn't help, 3d apps hang my pc. But finally UVD is working, doesn't mater with or without aspm.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #28 from rafael castillo jrch2k10@gmail.com --- Created attachment 106992 --> https://bugzilla.kernel.org/attachment.cgi?id=106992&action=edit Dmesg crash output for 3.11-rc2
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #29 from rafael castillo jrch2k10@gmail.com --- posted updated crash dmesg with kernel 3.11-rc2 in case it helps
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #30 from rafael castillo jrch2k10@gmail.com --- ok i tried the latest patches in drm-fixes and the crashes seemed to stop but i can't get the gpu to reclock
i tried /sys/class/drm/card0/device/power_dpm_state to performance alone and it never scales from state 0
if i try to force it with /sys/class/drm/card0/device/power_dpm_force_performance_level only accepts low
auto or high returns bash: echo: write error: Invalid argument
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #31 from rafael castillo jrch2k10@gmail.com --- uvd reclocks fine but the desktop flicker when it does
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #32 from Alex Deucher alexdeucher@gmail.com --- The dynamic re-clocking doesn't currently work reliably on SI asics.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #33 from rafael castillo jrch2k10@gmail.com --- well step by step is getting better, im happy enough to get stable desktop now and since my card render KDE like an monster reclocking is not uber important for me right now.
thanks for an awesome job ;)
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #34 from rafael castillo jrch2k10@gmail.com --- tested with today drm-fixes patches and its reclocking like a boss and xonotic passed from 30 FPS to an massive 190FPS in ultimate at 1366x768. i read you need some fixes for other part of asic for later so is up to you if you wish to close the bug report.
again a hundred bazillions thanks this is just awesome now
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #35 from Arek Ruśniak arek.rusi@gmail.com --- no pain no gain. Now everything works fast as hell. Even UVD is fliker-free now. Thanks Alex, best regards to you and radeon team.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #36 from Alex Deucher alexdeucher@gmail.com --- I guess this bug can be closed now?
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #37 from rafael castillo jrch2k10@gmail.com --- i guess yes, the only issue i find after this, is that KMS hang if you compile the kernel with radeon kms with Y instead of M
i can't findout why since is too early to see anything
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #38 from Alex Deucher alexdeucher@gmail.com --- (In reply to rafael castillo from comment #37)
i guess yes, the only issue i find after this, is that KMS hang if you compile the kernel with radeon kms with Y instead of M
i can't findout why since is too early to see anything
If you build the driver into the kernel, you also need to build the ucode into the kernel. I suspect the hang it due to missing ucode in the kernel image.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #39 from rafael castillo jrch2k10@gmail.com --- yeap make sense, i thought i did but is very probable im missing a step or two in the process, i tried just cuz i wanted to see if KMS could start earlier in the boot process since my PC with systemd but too fast and i can't even see kmscon kicking in because once the module load kdm is there. anyway as this bug is concerned all is peachy and since im using drm-3.12-next it got even better in some spots.
https://bugzilla.kernel.org/show_bug.cgi?id=60381
rafael castillo jrch2k10@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |CODE_FIX
https://bugzilla.kernel.org/show_bug.cgi?id=60381
--- Comment #40 from rafael castillo jrch2k10@gmail.com --- many thanks for your time and some nice piece of awesome work
dri-devel@lists.freedesktop.org