https://bugs.freedesktop.org/show_bug.cgi?id=99316
Bug ID: 99316
Summary: Radeon crash when laptop on AC power
Product: DRI
Version: unspecified
Hardware: Other
OS: All
Status: NEW
Severity: major
Priority: medium
Component: DRM/Radeon
Assignee: dri-devel(a)lists.freedesktop.org
Reporter: LordSamanon(a)gmail.com
Created attachment 128807
--> https://bugs.freedesktop.org/attachment.cgi?id=128807&action=edit
AC Power Unigine Heaven dmesg
I have a Dell laptop with an Intel iGPU and a Radeon dGPU (a Firepro W5130M,
recognized as a Radeon HD 8830M by lspci).
Whenever I attempt to use the Radeon card via DRI_PRIME=1 while the laptop is
plugged in, the card crashes. When on battery the card performs fine.
One way I have tested this is with glxgears. Often glxgears will work the first
time it is run, but if it is closed and then opened a few seconds later, it
will crash.
I have also tried running programs like Portal 2 and Unigine Heaven. These
always hang. Unfortunately the error messages produced in dmesg are not the
most consistent. Sometimes I get a backtrace, sometimes even a kernel panic on
reboot.
I'll post the various logs I have. One thing I have found is that glxgears does
not crash if I use radeon.runpm=0, however it still hangs when running
something like Portal.
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugs.freedesktop.org/show_bug.cgi?id=99181
Bug ID: 99181
Summary: RS780 blank screen on boot
Product: DRI
Version: XOrg git
Hardware: Other
OS: All
Status: NEW
Severity: normal
Priority: medium
Component: DRM/Radeon
Assignee: dri-devel(a)lists.freedesktop.org
Reporter: chithanh(a)gentoo.org
Hardware: BIOSTAR A780L
When booting with 3.15 or newer kernels, a monitor connected to the DVI port of
the BIOSTAR A780L will stay blank.
Kernel 3.14 and older work fine.
I tried forcing the output to on with video=HDMI-A-1:e but this made no
difference.
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugs.freedesktop.org/show_bug.cgi?id=99049
Bug ID: 99049
Summary: Machine freeze when clock are set to defaults
Product: DRI
Version: XOrg git
Hardware: Other
OS: All
Status: NEW
Severity: normal
Priority: medium
Component: DRM/Radeon
Assignee: dri-devel(a)lists.freedesktop.org
Reporter: ls(a)maxux.net
My laptop contains theses cards:
----
00:02.0 VGA compatible controller: Intel Corporation HD Graphics 520 (rev 07)
01:00.0 Display controller: Advanced Micro Devices, Inc. [AMD/ATI] Sun XT
[Radeon HD 8670A/8670M/8690M / R5 M330] (rev 81)
----
Under Gentoo, I enabled radeon and radeonsi as videos cards, everything looks
fine. According to xrandr, I have providers:
----
Provider 0: id: 0x76 cap: 0xb, Source Output, Sink Output, Sink Offload crtcs:
4 outputs: 3 associated providers: 0 name:Intel
Provider 1: id: 0x4f cap: 0xd, Source Output, Source Offload, Sink Offload
crtcs: 0 outputs: 0 associated providers: 0 name:HAINAN @ pci:0000:01:00.0
----
I found on the internet that DPM could cause issue, I tried: radeon.runpm=0
radeon.dpm=0 (see below)
Using theses settings, I don't have any freeze when I try to use the Radeon
Card (using DRI_PRIME=1), but I found that everything was slow. I checked, and
I saw:
---
cat /sys/class/drm/card1/device/power_method
profile
cat /sys/class/drm/card1/device/power_profile
default
cat /sys/kernel/debug/dri/65/radeon_pm_info
default engine clock: 1070000 kHz
current engine clock: 299990 kHz
default memory clock: 900000 kHz
current memory clock: 298990 kHz
voltage: 1150 mV
PCIE lanes: 4
---
The card is running in low profile by default, don't know why. Setting
power_profile to high, mid or low doesn't change anything, but if I set
power_profile back to default again, the clock is set to full speed.
When clock is set to full speed, my system freeze if I try to run any 3D
application (glxgears or a game using wine). Here is the dmesg log:
---
radeon 0000:01:00.0: ring 0 stalled for more than 10436msec
radeon 0000:01:00.0: GPU lockup (current fence id 0x00000000000014f4 last fence
id 0x00000000000014f6 on ring 0)
radeon 0000:01:00.0: Saved 49 dwords of commands on ring 0.
radeon 0000:01:00.0: GPU softreset: 0x00000049
radeon 0000:01:00.0: GRBM_STATUS = 0xE5D04028
radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0xEE400000
radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000006
radeon 0000:01:00.0: SRBM_STATUS = 0x200000C0
radeon 0000:01:00.0: SRBM_STATUS2 = 0x00000000
radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000
radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x00018000
radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00008000
radeon 0000:01:00.0: R_008680_CP_STAT = 0x80030243
radeon 0000:01:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
radeon 0000:01:00.0: R_00D834_DMA_STATUS_REG = 0x44C83D57
radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000
radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000
radeon 0000:01:00.0: GRBM_SOFT_RESET=0x0000DDFF
radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100
radeon 0000:01:00.0: GRBM_STATUS = 0x00003028
radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0x00000006
radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000006
radeon 0000:01:00.0: SRBM_STATUS = 0x200000C0
radeon 0000:01:00.0: SRBM_STATUS2 = 0x00000000
radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000
radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x00000000
radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00000000
radeon 0000:01:00.0: R_008680_CP_STAT = 0x00000000
radeon 0000:01:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
radeon 0000:01:00.0: R_00D834_DMA_STATUS_REG = 0x44C83D57
radeon 0000:01:00.0: GPU reset succeeded, trying to resume
[drm] probing gen 2 caps for device 8086:9d10 = 1724843/e
[drm] PCIE gen 3 link speeds already enabled
[drm] PCIE GART of 2048M enabled (table at 0x0000000000040000).
radeon 0000:01:00.0: WB enabled
radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000100000c00 and
cpu addr 0xffff88046c66dc00
radeon 0000:01:00.0: fence driver on ring 1 use gpu addr 0x0000000100000c04 and
cpu addr 0xffff88046c66dc04
radeon 0000:01:00.0: fence driver on ring 2 use gpu addr 0x0000000100000c08 and
cpu addr 0xffff88046c66dc08
radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000100000c0c and
cpu addr 0xffff88046c66dc0c
radeon 0000:01:00.0: fence driver on ring 4 use gpu addr 0x0000000100000c10 and
cpu addr 0xffff88046c66dc10
[drm:r600_ring_test [radeon]] *ERROR* radeon: ring 0 test failed
(scratch(0x850C)=0xCAFEDEAD)
[drm:si_resume [radeon]] *ERROR* si startup failed on resume
---
I found out with theses steps that, if I don't set dpm=0, I hit exactly the
same issue, I guess using DPM the clock is set to high when the card is used
and it crash. When the card is stuck on that loop, I need to reset the machine
(but network still works).
I'm using mesa-13.0.2 with a 4.7.6 kernel, I have the same issue using mesa
12.0.1
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugs.freedesktop.org/show_bug.cgi?id=98988
Bug ID: 98988
Summary: [Regression, bisected] New BONAIRE UVD firmware causes
DPM problems and extremely slow performance
Product: DRI
Version: unspecified
Hardware: x86-64 (AMD64)
OS: Linux (All)
Status: NEW
Severity: normal
Priority: medium
Component: DRM/Radeon
Assignee: dri-devel(a)lists.freedesktop.org
Reporter: falaca(a)gmail.com
Created attachment 128327
--> https://bugs.freedesktop.org/attachment.cgi?id=128327&action=edit
Kernel bisect log
I have a 2GB Radeon R7 260X (BONAIRE).
With kernel 4.7 and above, I was experiencing extremely slow performance. Even
desktop animations on Ubuntu 16.04 w/ Unity desktop are extremely choppy,
probably about 10fps.
dmesg produces several instances of the following error message:
[drm:ci_dpm_set_power_state [radeon]] *ERROR* ci_upload_dpm_level_enable_mask
failed
I did a kernel bisect, and narrowed the problem to the following commit:
http://git.kernel.org/cgit/linux/kernel/git/stable/linux-stable.git/commit/…
The bisect log is attached.
It seems that the commit adds support for a new firmware file,
"bonaire_uvd.bin". If the driver fails in loading the new firmware file, it
falls back to the legacy file, "BONAIRE_uvd.bin".
To confirm that the issue is caused by the new firmware, I deleted
bonaire_uvd.bin, and performance is restored to normal with the latest stable
kernel (4.9.0-rc7).
For what it's worth, here are the contents of
/sys/kernel/debug/dri/64/radeon_pm_info while idling on the Ubuntu desktop with
the new firmware:
uvd disabled
vce disabled
power level avg sclk: 115774 mclk: 15000
And the old firmware:
uvd disabled
vce disabled
power level avg sclk: 30248 mclk: 165000
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugs.freedesktop.org/show_bug.cgi?id=98987
Bug ID: 98987
Summary: [RS690] BUG: soft lockup when radeon modesetting
Product: DRI
Version: unspecified
Hardware: x86-64 (AMD64)
OS: Linux (All)
Status: NEW
Severity: critical
Priority: high
Component: DRM/Radeon
Assignee: dri-devel(a)lists.freedesktop.org
Reporter: vaclav.ovsik(a)gmail.com
Created attachment 128323
--> https://bugs.freedesktop.org/attachment.cgi?id=128323&action=edit
logged using netconsole - kernel 4.8.7 (Debian kernel)
Dear developers,
I'm unable to boot into functional system with KMS radeon. When
radeon.modeset=0 on kernel command-line system it starts and run OK.
Without disabled modeset the kernel 4.7, 4.8, 4.9-rc7 with 90% probability
locks or crash.
I'm running Debian unstable and tried Debian kernels 4.7, 4.8 from Debian and
compile 4.9rc7 vanilla kernel.
Attached logs through netconsole.
Any idea what I can test?
Thanks
--
Zito
ref: debian bug #845382 -
https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=845382
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugs.freedesktop.org/show_bug.cgi?id=98410
Bug ID: 98410
Summary: Applications crash when exiting
Product: DRI
Version: XOrg git
Hardware: x86-64 (AMD64)
OS: Linux (All)
Status: NEW
Severity: major
Priority: medium
Component: DRM/Radeon
Assignee: dri-devel(a)lists.freedesktop.org
Reporter: cismalescumlord(a)programmer.net
closing various applications leads to a crash. A couple of examples below. I
set DRI_PRIME=1 as that uses the Radeon card and not the Intel. My laptop runs
faster, cooler and quieter when I use this. Setting DRI_PRIME=0 does prevent
the crashes. Firefox, Thunderbird and PyCharm all run and close just fine with
DRI_PRIME=1.
kate
No docs r (or the only) opened right now --> disable menu
radeon: Failed to allocate virtual address for buffer:
radeon: size : 65536 bytes
radeon: alignment : 4096 bytes
radeon: domains : 4
radeon: va : 0x0000000000800000
radeon: Failed to deallocate virtual address for buffer:
radeon: size : 65536 bytes
radeon: va : 0x800000
radeon: Failed to allocate virtual address for buffer:
radeon: size : 65536 bytes
radeon: alignment : 4096 bytes
radeon: domains : 4
radeon: va : 0x0000000000800000
radeon: Failed to deallocate virtual address for buffer:
radeon: size : 65536 bytes
radeon: va : 0x800000
radeonsi: Failed to create a context.
No docs r (or the only) opened right now --> disable menu
We got some errors while running testparm "Load smb config files from
/etc/samba/smb.conf\nLoaded services file OK.\nWARNING: The 'netbios name' is
too long (max. 15 chars).\n\n"
0x1f61ce0 deleted without having been removed from the factory first. This
will leak standalone popupmenus and could lead to crashes.
KCrash: Application 'kate' crashing...
konsole
radeon: Failed to allocate virtual address for buffer:
radeon: size : 65536 bytes
radeon: alignment : 4096 bytes
radeon: domains : 4
radeon: va : 0x0000000000800000
radeon: Failed to deallocate virtual address for buffer:
radeon: size : 65536 bytes
radeon: va : 0x800000
radeon: Failed to allocate virtual address for buffer:
radeon: size : 65536 bytes
radeon: alignment : 4096 bytes
radeon: domains : 4
radeon: va : 0x0000000000800000
radeon: Failed to deallocate virtual address for buffer:
radeon: size : 65536 bytes
radeon: va : 0x800000
radeonsi: Failed to create a context.
KCrash: Application 'konsole' crashing...
This works fine with Kernel 4.7
$ kate
No docs r (or the only) opened right now --> disable menu
No docs r (or the only) opened right now --> disable menu
We got some errors while running testparm "Load smb config files from
/etc/samba/smb.conf\nLoaded services file OK.\nWARNING: The 'netbios name' is
too long (max. 15 chars).\n\n"
0x1515460 deleted without having been removed from the factory first. This will
leak standalone popupmenus and could lead to crashes.
$ uname -a
Linux andromeda-ascendant 4.7.6-1-default #1 SMP PREEMPT Fri Sep 30 12:22:14
UTC 2016 (fb37fcc) x86_64 x86_64 x86_64 GNU/Linux
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugs.freedesktop.org/show_bug.cgi?id=97896
Bug ID: 97896
Summary: RADEON DisplayPort - Monitor shows out of range in
some modes
Product: DRI
Version: unspecified
Hardware: x86 (IA32)
OS: Linux (All)
Status: NEW
Severity: normal
Priority: medium
Component: DRM/Radeon
Assignee: dri-devel(a)lists.freedesktop.org
Reporter: jan.burgmeier(a)unicon-software.com
Created attachment 126733
--> https://bugs.freedesktop.org/attachment.cgi?id=126733&action=edit
dmesg output with drm.debug=0x1e log_buf_len=1M
Hi,
I have two monitors connected to my PC one via DVI and one via DP. The monitor
connected via DP shows "frequency out of range" when some modes including the
preferred one are used. This happens in X and in the KMS framebuffer. For
example on the attached logs the preferred mode of 1680x1050 does not work
where as 1680x945 works perfectly fine.
Kernel: 4.4.11
Xorg: 1.18.2
xf86-video-radeon: 7.7.0
Last working kernel: 3.19
First broken kernel: 4.0
Here is the bisect info, the 4.0 release segfaulted in the radeon driver but
the change which broke this was before so I hope the bisect is right:
# first bad commit: [e55bca26188e45f209597abf986c87cc5a49894a] radeon/audio:
enable DP audio
After finding the bad commit I tried booting with radeon.audio=0 this made the
display work for 3.19 kernel with that commit but did not work for the 4.4.11
kernel.
Attached log files:
- dmesg output with drm.debug=0x1e log_buf_len=1M
- lspci -vv output
- video card bios
- Xorg.0.log
- xrander --verbose output
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugs.freedesktop.org/show_bug.cgi?id=97856
Bug ID: 97856
Summary: Computer restart playing 3D games (possibly
overheating)
Product: DRI
Version: XOrg git
Hardware: Other
OS: All
Status: NEW
Severity: normal
Priority: medium
Component: DRM/Radeon
Assignee: dri-devel(a)lists.freedesktop.org
Reporter: tukkek(a)gmail.com
Hello, sorry if I'm reporting this to the wrong product but the bug report
procedures on the wiki are pretty hard to understand
https://www.x.org/wiki/RadeonFeature/#index11h2
I have an onboard Radeon HD 3000 (ATI RS780L). I am primarily using Debian
testing ("stretch"). I believe the open source drivers for it are providade by
the following packages:
linux-image-4.6.0-1-amd64 (4.6.4-1, radeon.ko)
xserver-xorg-video-ati (1:7.7.0-1)
xserver-xorg-video-radeon (1:7.7.0-1, radeon_drv.so, due to update in a few
days)
The problem I'm having is that when playing 3D games the computer will randomly
crash and reboot after anywhere from 10 minutes into the game, to over an hour
of gameplay without rebooting (mostly around 10-30 minutes with a crash). It
seems to be heat-related because when it's cooler it seems the game has less
chance of crashing and when a crash happens and I try playing again after the
reboot is complete, the crash seems to happen more rapidly - maybe after 5
minutes or so playing.
The crash seems to be some sort of hardware failure because there are no trace
in the journald persistent logs that I can find. For this, I can't also be sure
what is happening. Let me know if there is something I can do to help debug
this.
To verity if the error was the driver's fault I installed a new Debian system
(oldstable, Debian 7 "wheezy"), which allowed me to install the fglrx legacy
driver with Radeon HD 3000 support. In this new system I haven't experienced a
single reboot so far - which establishes the cause isn't hardware-only related
but very likely a driver issue. Being an entirely new system means it could be
something else too but since I have very frequent crashes/reboots in my primary
system and none so far in the alternate system while running games for an hour
or so frequently on a hot day, would indicate that the fault is indeed coming
from the Radeon open-source criver.
I haven't done any 3D gaming in this computer before a couple of weeks from now
so I can't say that this bug only happens on recent driver versions or not.
Watching videos in a browser or in a video application (such as VLC) and 2D
games like http://littlewargame.com or rendering videos (via kdenlive or such)
do not cause reboots, even though they can be relatively heavy on the GPU. I
haven't had any random crashes in a very long while as well except when doing
3D gaming. I've installed a few games to test it out and whenever 3D gaming the
crashes do happen frequently. Some of the games I've used to test this are
Heroes of Newerth and Runescape (both free to download and play) and very
lightweigth in the low settings, so it shouldn't be a quesiton of me stressing
the card too much either (HoN for example works fine with the fglrx legacy
driver on my alternate system).
I have run memtest86, CPU and memory stress (stress-ng 0.06.15-1) in the hopes
of catching a spike in my machine's temperature as the culprit for these random
crashes but I've found the temperature to be stable and low even during heavy
load for a long time. I've ran the Geeks3D GpuTest, which puts a heavier load
than these games on the graphics card but haven't been able to cause a crash,
even though in this case my tests haven't been extensible - I can run them for
longer though if it would help debug the issue.
I undestand that there have been recent updates on the graphics drivers on the
new Linux kernel update. I will try the new drivers as they come out since it's
somewhat of a bother having to reboot the computer (and maintain a legacy
system) whenever I want to play 3D games and if the problem is solved I'll
report back here. If I don't comment on this issue in the near future it's
because the problem persists even with the new drivers.
My guess about what is happening: since the problem seems to be heat-related
maybe there is some sort of temperature sensor that the open source driver
isn't able to read on my card - which I was expecting to be able to see using
lmsensors (version 1:3.4.0-3) while maybe the fglrx is able to read and handle
heat properly.
I don't usually report bugs to trackers that already have many reports open but
since I've spent a long time in tracking this issue and was able to fix it, I
thought that I should share all the information I've gathered in the hope it's
useful. It's an older, onboard graphics card model, probably getting to the end
of its lifespan soon but I hope this report is valuable in some way, anyway.
Thank you for the good work on these open drivers. If I were able to use them
on my primary system I'd certainly do it, even if the fglrx legacy system is a
little bit smoother, since it would be a lot more convenient than maitaning a
separare gaming system in my machine. Thanks again for the contributions to the
FOSS community and for the time reading this report!
--
You are receiving this mail because:
You are the assignee for the bug.