Specifications:
New Computer:
- Motherboard: ROG z790-i gaming
- CPU: 13900k
- RAM: G.Skill Trident Z5 64gb XMP
- PSU: Silverstone SX1000
- GPU: EVGA 3070 FTW3 ULTRA
Old Computer:
- Motherboard: MSI Z87I GAMING AC
- CPU: 4790k
- RAM: Corsair Vengeance Pro Series 16GB
- PSU: Silverstone SX700-G
- GPU: EVGA 780ti
Story:
I Built a pc january of 2023 being "new computer". New computer was working well with a second hand 3070, and I ran this system with no issues for a couple months. Gaming was rock solid and anything I would throw at it was no feat, that was until I started to install new nvidia drivers... This is when I started to experience the intermittent flashing. It would flash to black a couple time and for a few seconds and then it would continue running as usual. I had noticed that this flashing was more common when I was playing a game like fornite, but it also occured when I would simply be browsing.
I tolerated the issue for a couple months as it happened intermittently and only lasted a few seconds so it was sorta bearable and everything worked fine besides the black flashing.
At some point, it was just happening more often and I had to find a fix. Because it wasn't happening on the older drivers I had originally started with, I decided to use those drivers. The older drivers did fix the intermittent flashing, but there was still some weird behavior with the screen flashing black. It was flashing horizontally for a split second on 1/6 of the upper part of the screen. I thought it was maybe just my monitor acting up so I just ignored it and continued as usual for a couple more weeks.
Anyways, at some point I decided to upgrade my drivers again thinking the newer drivers being released would fix my issue. But when I updated to the latest drivers the issue came back and once again intermittently flashing to black. At this point I was annoyed the flashing would now last up to 10 seconds in a game and it was starting to become a burden. Because all the issues I experienced seemed to be GPU/Image related, I started to believed it was a GPU issue. I contacted EVGA and asked them if they could RMA this card and they were very helpful and provided me an RMA 3070.
When I installed the RMA 3070 everything worked smooth for about 2 weeks, until one day it just started flashing again. This time however something new was happening, it was artifacting then flashing to black. I was really concerned with the new symptom, it made me believed that there was now a critical problem with my system.
Before I tried to trouble shoot any hardware, I first tried to do all the basic software troubleshooting I could possibly do. DDU, Install new drivers, reinstall drivers, update bios, fresh windows install, test on linux. But regardless of all the troubleshooting I did, the problem still persisted and began to get worse over time.
Since software troubleshooting did not solve my issue, I moved on to testing all my hardware. I wasn't sure were to start, but since all the issues were GPU related and this was a new RMA, I refused to believe there was an issue with the GPU. I instead attributed the issues to it being a bad PCIe slot on the motherboard or possibly even the riser card included in my case.
I RMA'd the Motherboard and purchased a new riser card. Once I had the RMA motherboard, I installed the motherboard as well as the new riser card, but to my surprise the issue was still there...
A New motherboard and a new riser card did not fix my issue. Since this didn't fix the issue all I had left was to replace the CPU and RAM. I RMA'd the CPU and bought ram to replace the current one I had. But even when I swapped the CPU and RAM the problem was still there...
My pc was almost all new at this point the only part I hadn't replaced was the PSU. In a last ditch effort I tried to troubleshoot my system again, this time doing stress test's, mem test, any possible test i could think of. I even thought it was maybe my monitor or the cables. I ended up purchasing new media cables and tested the system on monitors, but once again the issue persisted.
I Began to dig a bit deeper and started to use the Event Viewer to get a hint at what was going on. The logs on event viewer did give me some details as to what was possibly happening when the screens started to artifact and go black.
Event ID 14 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.If the event originated on another computer, the display information had to be saved with the event.The following information was included with the event: \Device\000000af0000(0000) 00000000 00000000The message resource is present but the message was not found in the message table
The description for Event ID 0 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.If the event originated on another computer, the display information had to be saved with the event.The following information was included with the event: \Device\000000afUCodeReset TDR occurred on GPUID:100The message resource is present but the message was not found in the message table
3.The description for Event ID 0 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.If the event originated on another computer, the display information had to be saved with the event.The following information was included with the event: \Device\000000afError occurred on GPUID: 100The message resource is present but the message was not found in the message table
I would get hundreds of these errors in my event viewer. I had researched and googled these errors and found some posts with people experience similar issues and there were dozens of possible fixes and suggestions which i tried, powermanagement, pcie3 or 4, and many more that I can't remember at the momemnt.
Eventually I gave up, I couldn't figure out what was going on. I didn't want to accept that EVGA had possibly sent me a bad GPU, which could be possible. I contacted EVGA again,provided them with the logs and images and videos of the symptoms. I as well explained everything I had done to fix the issue. EVGA fortunately agreed to possibly being a GPU issue and sent me a third RMA GPU.
After some waiting the new RMA was here. I was so excited hoping that the new gpu would be an end to all my problems, but boy was I so wrong.
I installed the new RMA GPU and immediate artifacting and now BSOD crashing!(TDR error as the logs state). At this point I had not freaking clue what was going on I can't express how devastated I was. I had already gone through so much stress and sleepless nights trying to fix this issue only to have the 3rd RMA 3070 to fail on this system. I did not understand everything was basically new on this PC besides the PSU.
Since I was Running out of Ideas, I dug out my old pc out of storage. The specs on this old pc are listed above.
Since the only part I hadn't swapped out yet was the PSU I started with that. I swapped PSU's, but the problem was still there... However, now it was much much worse at this point the GPU was Artifacting on boot and stayed artifacting in the OS and also the OS stopped recognizing the card unable to open the nvidia settings saying there is no nvidia gpu plugged in and when i would launch a game it would say no compatible d11 card was detected.
Since I did an advance RMA I still had access to the first RMA, I took advantage and began testing both GPU's on my system and the problems were present on both dang cards.
I even tested the card on the old PC and it present really different issues. On the old pc on one card it would artifact for a split second and not boot. The other it would boot but the driver never initiated and would be in the lowest resolution but there was any artifacting. The old pc had an old 780ti so I thought let me try that on my new pc and it worked. It booted with no issues was recognized by the system with no issue I was able to installed the 780ti drivers with no issue and I was even able to boot into a game without an issue! Being in disbelief that an old 780 that was dusty and beat to hell, i say this because I accidentally dropped it years ago and put a dent on the pcb, it was still able to function and even play games with out an issue. I had to make sure the 780ti was going to hold out and survive so I left a game running for over 12 hours and I still have it installed on my system 48 hours late and there has not been a single issue arise.
At this point I don't know what else to do I've been through so much stress and gone sleepless night over this damn issue. There is only one theory i could think of, because I still hadn't swapped out the PSU I was thinking maybe the PSU was causing some sort of electrical issue causing damage to the GPU's but the 780ti has been running fine with same the PSU. Regardless I'm currently in the process of RMA'ing the PSU and currently waiting for a replacement. Because I'm paranoid and suspecting electrical issue I also bought a UPS with power conditioning to help maybe smooth out what every "dirty" electricity that is being supplied.
I'm extrememly depressed I wasn't able to find a solution to my problem and I'm just waiting on EVGA to respond to my tickets I sent about this issue. At this point I don't know what else to do other than to buy another card and different brand and model. I've already spent so much money trying to fix this issue though and coughing up $500+ for a new GPU really sucks.
That's my story with this issue. I'm willing to hear any opinions on as to what could've possible happened. Thank you for reading.
Symptoms:
- GPU Intermittently Artifacting / black screening / or BSOD.
- IMAGE 1
- IMAGE 2
- VIDEO 1
- VIDEO 2
- Occurs when browsing firefox or playing game intermittently.
- Started occurring when I began updating Nvidia drivers.
- Produces various event error in "Event Viewer"
- GPU Intermittently Artifacting / black screening and BSOD on two computer with different hardware
Troubleshooting attempts:
- RMA GPU 2 times
- RMA mother board and replaced riser card
- RMA CPU
- Purchased new ram
- Updated all drivers and bios and anything that could be updated
- tested gpu on two different computers with the same results
- TESTING CARDS VIDEO 1
- TESTING CARDS VIDEO 2
- windows fresh install and tested with fedora linux with the same result
- trying older 780ti gpu on new build with out an issue.
- 780TI NO ISSUE RUNNING VIDEO
- tried different media cables, purchased differe display ports and HDMI cables.
- tried a variety of monitors and now even have an RMA on my monitor because I thought that was a cause as well but it literally happens on all of monitors, I have 6 different monitors.
I just want to use my computer in peace without the fear of an issue arising.
EDIT: IT SEEMS LIKE MY LINKS TO MY IMAGES AND VIDEOS WERE REMOVED. I'M ASSUMING IT IS NOT ALLOWED. ANYONE KNOW A WAY I COULD STILL SHARE THE VIDEOS AND IMAGES?
ALSO SORRY FOR THE TYPOS, IT'S 2AM I'M TIRED.
post edited by kmachappy - 2023/07/04 05:26:56