I've had two different 3090 FTW3 Ultra's, and they've each failed on me under similar circumstances within 24 hours after getting them. For the first card, my screen went black when gaming, followed by the card's fans getting very loud (I'm assuming thy were at 100%). My PC shutting off after a few minutes and I thought I smelled a slight burning smell. During any subsequent attempt to power on the PC, the RGB lights on the card wouldn't power on, a small red LED was on above the leftmost power connector, and no signal was sent to the monitor in spite of the OS booting up (verified by the program Multiplicity sending mouse inputs to my laptop through my LAN). ENGA gave me an advance RMA for this card, and to be safe, I installed a new power supply before I got the second card.
My second card failed in almost the same way when gaming. This time, the screen also went black, followed followed by the card's fans getting very loud. The PC didn't shut off until I forced a power off after ~10 minutes. I did not notice any burning smell this time, and the exhaust from the card kept getting cooler over time until it felt room temperature, making me think the card wasn't handling any workload anymore. Like with the first card, the card didn't send a signal to the monitor in subsequent attempts to power on the PC, though the PC booted into Windows (like with the first card). Unlike with the first card, the RGB lights on the card lit up, and none of the small LEDs above the three power connectors were lit up. I also hear a fan whining noise. My PC works fine when installing one of my previous graphics cards. I had used DDU before installing this card.
For the second card, I tried using another PCIE slot in the the PC, tried after disabling gsync, and briefly tried in a friend's borrowed PC, all with the same results. The integrated graphics on the friend's machine was able to connect to a monitor, and the device manager was unable to identify the 3090, even after scanning for hardware changes.
Both times, the failure occurred when playing the same game (GTA V with mods "ReShade 2.0.3 with SweetFX 2.0" and "ScriptHookV_1.0.2060.1" installed) after playing other, more intensive games without incident (Control, Minecraft RTX beta, Watch Dogs 3, RTX Quake II, etc.). I had also ran EVGA Precision X1 prior to each failure, which automatically updated the firmware for each card.
Any advice or similar experiences? I don't think this is a power supply issue because (1) the first power supply worked fine with two Titan X's (Pascal gen) in SLI, which have a combined power draw above the 3090, and (2) because I replaced the power supply with a new power supply. Each time, I was using 3 separate power cables for the three power connectors on the 3090's. One of the people from EVGA tech support suggested it may be due to the motherboard, perhaps because this card draws more power from the PCIE slot. I'll probably install a new motherboard when I'm able to buy a Ryzen 5950x or 5900x, but who knows how long that will be. They have agreed to do another RMA, but I'm nervous this third card might fail, and I'm not sure if I want to wait until I get one of the new Ryzen cards before I try a new 3090.
I went through the Event Viewer to see if I could find anything seemingly relevant and found these around the time of the second card's failure:
1:45:57pm
Display driver nvlddmkm stopped responding and has successfully recovered.
1:46:11pm
Winlogon in session 1 (console) reuqested session stop using GPU, returned status STATUS_SUCCESS, with progress stage of successful
1:46:11pm
Faulting application name: dwm.exe, version: 10.0.19041.508, time stamp: 0xcd97c98b
Faulting module name: KERNELBASE.dll, version: 10.0.19041.572, time stamp: 0x1183946c
Exception code: 0xe0464645
Fault offset: 0x000000000010b65c
Faulting process id: 0x28a0
Faulting application start time: 0x01d6afc6dd03cd4c
Faulting application path: C:\WINDOWS\system32\dwm.exe
Faulting module path: C:\WINDOWS\System32\KERNELBASE.dll
Report Id: 54a5eac1-0ffd-4877-8369-d09505f60fbb
Faulting package full name:
Faulting package-relative application ID:
1:46:53pm
Faulting application name: GTA5.exe, version: 1.0.2060.1, time stamp: 0x5f4d2237
Faulting module name: GTA5.exe, version: 1.0.2060.1, time stamp: 0x5f4d2237
Exception code: 0xc0000005
Fault offset: 0x00000000015ccc54
Faulting process id: 0x40b0
Faulting application start time: 0x01d6afc64133ff6c
Faulting application path: G:\I games\Rockstar Games\Grand Theft Auto V\GTA5.exe
Faulting module path: G:\I games\Rockstar Games\Grand Theft Auto V\GTA5.exe
Report Id: 77e6e540-ac07-417d-b603-aa07dba38047
Faulting package full name:
Faulting package-relative application ID:
My system specs:
CPU: i7 8700k (OC @4.8 GHz all core turbo, hyperthreading disabled)
RAM: G.SKILL Ripjaws V Series 64GB (4 x 16GB) 288-Pin DDR4 SDRAM DDR4 3200 (PC4 25600) Desktop Memory Model F4-3200C14Q-64GVK
Motherboard: ASRock Fatal1ty Z370 Gaming K6 LGA 1151
Power supply (first failure): Thermaltake TOUGHPOWER GRAND 80 Plus Gold 1200W (TPG-1200MPCUS)
Power supply (second failure): Seasonic PRIME GX-1000 (SSR-1000GD)
UPS:
CyberPower CP1500AVRLCD Intelligent LCD UPS System, 1500VA/900WStorage: 1 SSD + 5 HDDs
Sound card: Asus Xonar DGX (installed in a PCIE x1 slot)
Display: PG27UQ (gsync enabled when failures occurred, HDR on during one failure and off on another)
EDIT: I've had my 3rd 3090 FTW3 Ultra for about a week now, and it's still running. I've avoided installing Precision X1 or playing Grand Theft Auto V with this card (I have no idea if either had anything to do with the failures of the first 2 cards, but I decided to wait until after I've beaten Cyberpunk to try either).
post edited by joshm60 - 2020/11/14 19:11:33