EVGA

UNSTABLE MACHINE - Dunno why

Author
Drazhar
CLASSIFIED Member
  • Total Posts : 2370
  • Reward points : 0
  • Joined: 2008/12/07 02:14:51
  • Status: offline
  • Ribbons : 4
2013/05/30 23:11:59 (permalink)
So I haven't folded super actively in a few days because well... the AC in the house is broken OK?

Try to get a WU finished, POW, unstable
 
Starts a new one, folds for over an hour, temps not pushing past 75 I believe, UNSTABLE
 
I'm running the Precision X GPU test and not seeing any artifacts, but I know my NV drivers crashed earlier today playing Need For Speed World... a bit concerned.
 
What do I do folding brethren?


 
#1

22 Replies Related Threads

    Panther-X
    New Member
    • Total Posts : 54
    • Reward points : 0
    • Joined: 2013/05/08 07:19:03
    • Status: offline
    • Ribbons : 1
    Re:UNSTABLE MACHINE - Dunno why 2013/05/30 23:48:34 (permalink)
    If you are folding P8900 (FahCore_17), I would recommend that you lower the OC even if it is factory one. The reason is that this Project is the largest ever on the GPU so is stressful when compared to other projects. During the testing, we found out that even factory overclocked GPUs were unstable and WUs would only complete successfully once the clocks were lowered to AMD/Nvidia Stock.
     
    Since there is a crash in the game, it could be a once only glitch or it could be an indication that the overclock is unstable. I would recommend that you run some benchmarking software to stress the GPU to find the stable overclock.

    #2
    Xavier Zepherious
    CLASSIFIED ULTRA Member
    • Total Posts : 6746
    • Reward points : 0
    • Joined: 2010/07/04 12:53:39
    • Location: Medicine Hat ,Alberta, Canada
    • Status: offline
    • Ribbons : 16
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 00:02:30 (permalink)
    if you bothered checking his mod rigs Panther-X...he's running a stock 570 (no OC)
     
    what we need to see is the actual log file portions to make any decisions on whats going on (ie the start of the WU and where the error occurred for that particular unit....like WU info, core info)
     
    Im more inclined to believe that core 17 isn't quite that stable yet, or the driver needs improving...which is why it's failing on previous stable cards...core 17 is beta so are any WU's for it
     

     


    Primes found     Affiliate Code:YN2AHK39LH



     
    #3
    Drazhar
    CLASSIFIED Member
    • Total Posts : 2370
    • Reward points : 0
    • Joined: 2008/12/07 02:14:51
    • Status: offline
    • Ribbons : 4
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 00:04:15 (permalink)
    Projects were 7626 and 8074, pretty sure my card is stock since I haven't messed with it and didn't buy an SC or higher version.

    I'll check around though.


     
    #4
    Xavier Zepherious
    CLASSIFIED ULTRA Member
    • Total Posts : 6746
    • Reward points : 0
    • Joined: 2010/07/04 12:53:39
    • Location: Medicine Hat ,Alberta, Canada
    • Status: offline
    • Ribbons : 16
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 00:29:00 (permalink)
    if you're running beta NV drivers go back to stable whql ones
     
    http://forums.evga.com/tm.aspx?m=1937207
    new drivers got released in last week - which could be the issue
     
    post edited by Xavier Zepherious - 2013/05/31 00:32:07


    Primes found     Affiliate Code:YN2AHK39LH



     
    #5
    wrinvert
    R.I.P Friend
    • Total Posts : 3751
    • Reward points : 0
    • Joined: 2007/05/01 19:32:12
    • Location: lost forever
    • Status: offline
    • Ribbons : 17
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 01:37:25 (permalink)
    i'm running 3 680's on drivers 320.00 with 100+ O/C on FTW's(evga o/ced) and didn't have a 1 fail on me in 2 days.


     
    #6
    Panther-X
    New Member
    • Total Posts : 54
    • Reward points : 0
    • Joined: 2013/05/08 07:19:03
    • Status: offline
    • Ribbons : 1
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 02:27:11 (permalink)
    Xavier Zepherious

    if you bothered checking his mod rigs Panther-X...he's running a stock 570 (no OC)
    ...

    Please note that while I saw the signature, I wasn't aware that "Mods Rigs" was an actual URL link to the system specification. Never clicked on it before nor knew about it until now.
     
    Given general description, the suggestions too were general. However, we now know that the issue is related to FahCore_15 since Drazhar has posted the Projects that crashed. It would be help if the log was posted as Xavier Zepherious suggested. Have you tried a clean installation of the latest WHQL Driver? Moreover, is there any dust build up in your PC?

    #7
    Viper97
    CLASSIFIED ULTRA Member
    • Total Posts : 5208
    • Reward points : 0
    • Joined: 2007/09/07 13:06:18
    • Location: Chillin'
    • Status: offline
    • Ribbons : 11
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 05:08:40 (permalink)
    My stock 670's with no OC handle the 8900's just fine.  I've been beating them up for the last 30 hours.  Temps here are 60C with 80% fan speed.  Mind you it's not as warm here in the Pacific Northwest as it is in some parts of the world.  So I'm enjoying a bit more folding time.
     
    That said, it's possible if you do down clock the GPU and it works you could have a marginal GPU.  Folding will sure pull the marginal hardware out of a line up and expose it for everyone to see.
     
     


     
    #8
    TheWolf
    CLASSIFIED Member
    • Total Posts : 3800
    • Reward points : 0
    • Joined: 2007/11/14 16:05:23
    • Location: Moss Point, Ms
    • Status: offline
    • Ribbons : 9
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 07:15:50 (permalink)
    Its a known fact that some Nvidia drivers stop responding when doing so the video card card can become down clocked among other things when this happens, these things cause FAH to crash.
     
    Do a video driver clean and fresh install of the latest known good driver. I'm using 314.22 without problems.

    EVGA Affiliate Code ZHKWRJB9D4 My HeatWare 
     
    #9
    TheWolf
    CLASSIFIED Member
    • Total Posts : 3800
    • Reward points : 0
    • Joined: 2007/11/14 16:05:23
    • Location: Moss Point, Ms
    • Status: offline
    • Ribbons : 9
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 07:29:31 (permalink)
    Panther-X 

    The reason is that this Project is the largest ever on the GPU so is stressful when compared to other projects.


    There for it should have a better point value and even better QRB than any other GPU WU.
    Can't wait to see what the real numbers will be for projects like this once there released to general public.

    EVGA Affiliate Code ZHKWRJB9D4 My HeatWare 
     
    #10
    Panther-X
    New Member
    • Total Posts : 54
    • Reward points : 0
    • Joined: 2013/05/08 07:19:03
    • Status: offline
    • Ribbons : 1
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 08:17:38 (permalink)
    From what I know, TJ Lane handles the points while Proteneer does the coding. Considering that this is a real project, it does open a huge potential for the future.
     
    REMOVED
     
    FahCore_17 does change the dynamics of points. Hopefully,  it will have a positive impact on the largest possible donor base with very little negative impact.
     
    BTW, 100 ns/day Explicit has been achieved in lab on GTX Titan with CUDA  All we have to do is to wait (im)patiently for Nvidia to release JIT and see our GPUs (and PPD) pushed to the limits
     
    EDIT -> Sorry for the miscommunication and any inconvenience caused. That wasn't my intention.
    post edited by Panther-X - 2013/05/31 20:05:17

    #11
    TheWolf
    CLASSIFIED Member
    • Total Posts : 3800
    • Reward points : 0
    • Joined: 2007/11/14 16:05:23
    • Location: Moss Point, Ms
    • Status: offline
    • Ribbons : 9
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 11:57:13 (permalink)
    That's what I'm talking about!! Bring on the bigadv GPU Wu's with bigadv points and k-factor/QRB.
    Panther-X

    From what I know, TJ Lane handles the points while Proteneer does the coding. Considering that this is a real project, it does open a huge potential for the future.

    Just for testing purposes, with bigadv points, you get this on the GPU with P7663 (special testing WUs):
    GTX 660 Ti OC -> 00:00:11 - 2,262,109.09 PPD
    GTX 650 OC -> 00:00:30 - 502,250.80 PPD
    GTX 675M Stock -> 00:00:31 - 478,145.40 PPD

    FahCore_17 does change the dynamics of points. Hopefully, it will have a positive impact on the largest possible donor base with very little negative impact.

    BTW, 100 ns/day Explicit has been achieved in lab on GTX Titan with CUDA All we have to do is to wait (im)patiently for Nvidia to release JIT and see our GPUs (and PPD) pushed to the limits


    post edited by TheWolf - 2013/05/31 11:59:20

    EVGA Affiliate Code ZHKWRJB9D4 My HeatWare 
     
    #12
    muziqaz
    New Member
    • Total Posts : 56
    • Reward points : 0
    • Joined: 2012/06/01 13:59:59
    • Status: offline
    • Ribbons : 0
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 12:26:06 (permalink)
    Panther, are you having a day off in here or are you just joking?
    Those numbers you posted about insane PPD are from internal testing with micro units. Units were decreased in size to catch certain bugs, but point estimate value of that unit was left from unit leaps and bounds bigger than micro unit. You do remember 7970s pulling 5-7 sec in TPF with estimated PPD of millions and millions. If it was April, I would laugh at it, but now it is not really funny, especially in this forum :)
    post edited by muziqaz - 2013/05/31 14:49:54

    3930k@4.1ghz
    x79 W T F
    16Gb of G.Skill 2133Mhz
    r9 290x
    OCZ Vertex 4 128Gb+5TB HDDs
    Win 7 64 Ultimate
    #13
    Grandpa_01
    New Member
    • Total Posts : 91
    • Reward points : 0
    • Joined: 2012/04/28 20:59:00
    • Status: offline
    • Ribbons : 0
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 13:54:42 (permalink)
    From what I have seen bigadv on a GPU is quite a ways off if it will ever happen, at least in the type of points being discussed. The current 8900 is the largest WU in existence for GPU's and it is the size of the common a4 WU that is a very long ways from a bigadv.
     
    I would not get my hopes up too much on seeing bigadv for GPU's in the near future there are still allot of steps that need to be taken, And I am not even sure that GPU's will be able to do them these WU's have actually chocked the GPU's pretty hard as far as being able to OC the cards and stability.
    #14
    Macaholic
    iCX Member
    • Total Posts : 338
    • Reward points : 0
    • Joined: 2008/08/18 18:07:59
    • Location: 1 Infinite Loop
    • Status: offline
    • Ribbons : 2
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 14:05:22 (permalink)
    Panther-X


    FahCore_17 does change the dynamics of points. Hopefully,  it will have a positive impact on the largest possible donor base with very little negative impact.

    BTW, 100 ns/day Explicit has been achieved in lab on GTX Titan with CUDA  All we have to do is to wait (im)patiently for Nvidia to release JIT and see our GPUs (and PPD) pushed to the limits



    Fold! It does a body good!™
    Mac Pro Dual Quad Core X5570 - 2 x Mac Pro Dual Quad Core X5365 - 2 x Asus Z8NA-D6C, Dual Hex Core E5645 - Supermicro H8QGi-F-O, Quad 12-Core Opteron 6174 - Supermicro H8QGi-F-O, Quad 16-Core Opteron 6272 - Tyan S8812, Quad 12-Core Opteron 6166HE - Supermicro H8QGi-F-O, Quad 12-Core Opteron 6164HE - 5 x EVGA Classified GeForce GTX 590


    #15
    Drazhar
    CLASSIFIED Member
    • Total Posts : 2370
    • Reward points : 0
    • Joined: 2008/12/07 02:14:51
    • Status: offline
    • Ribbons : 4
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 14:33:24 (permalink)
    So I went and installed 314.22 over my old 306.whateveritwas

    Test now, but the game response doesn't look promising. I'm guessing high clock issues.


     
    #16
    Drazhar
    CLASSIFIED Member
    • Total Posts : 2370
    • Reward points : 0
    • Joined: 2008/12/07 02:14:51
    • Status: offline
    • Ribbons : 4
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 15:19:19 (permalink)
    And nope, failed again.

    I'll call EVGA.


     
    #17
    Simba123
    CLASSIFIED Member
    • Total Posts : 2844
    • Reward points : 0
    • Joined: 2011/05/10 23:15:21
    • Location: Australia
    • Status: offline
    • Ribbons : 14
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 16:17:18 (permalink)
    Try doing a clean install of the nVidia drivers instead of an upgrade.
     
    http://www.nvidia.com/object/IO_13955.html
     
     


     
     
      
                                   
     
    #18
    Panther-X
    New Member
    • Total Posts : 54
    • Reward points : 0
    • Joined: 2013/05/08 07:19:03
    • Status: offline
    • Ribbons : 1
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 18:18:00 (permalink)
    muziqaz

    Panther, are you having a day off in here or are you just joking?
    ...

    I wasn't having an off day, nor joking. The point that I was making (which didn't come across as I expected) is that when and if bigadv comes to GPUs and they are using the bigadv benchmarking methods, high-end GPUs can make a significant amount of PPD. Now, I am not sure when or if it will happen, but I would say that the future is looking bright for GPU bigadv:

    21:27 < proteneer_osx> we might be able to do 500,000 to 1 million atoms
    ...
    21:27 < proteneer_osx> on the GPU
    ...
    23:25 <@proteneer> we've done 1 million attoms on a kepler before successfully
    Source -> http://sodaant.com/fah.log
     
    Sorry if my post caused any confusion as that wasn't my intention.

    #19
    codebluetoo
    iCX Member
    • Total Posts : 421
    • Reward points : 0
    • Joined: 2008/01/30 22:54:13
    • Location: Earthquake bunker, Vancouver Island, BC
    • Status: offline
    • Ribbons : 3
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 19:32:46 (permalink)
    Note that I too have had a couple of crashes the last couple of days.  No OC and normally my PC is very stable.  Seems to be fine over the last 12+ hrs.  I do have an I7


     
      Folding 4 Life 

        

      


    #20
    Quisarious
    New Member
    • Total Posts : 24
    • Reward points : 0
    • Joined: 2013/03/26 12:45:40
    • Status: offline
    • Ribbons : 0
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 19:37:22 (permalink)
    Panther-X

    muziqaz

    Panther, are you having a day off in here or are you just joking?
    ...

    I wasn't having an off day, nor joking. The point that I was making (which didn't come across as I expected) is that when and if bigadv comes to GPUs and they are using the bigadv benchmarking methods, high-end GPUs can make a significant amount of PPD. Now, I am not sure when or if it will happen, but I would say that the future is looking bright for GPU bigadv:

    21:27 < proteneer_osx> we might be able to do 500,000 to 1 million atoms
    ...
    21:27 < proteneer_osx> on the GPU
    ...
    23:25 <@proteneer> we've done 1 million attoms on a kepler before successfully
    Source -> http://sodaant.com/fah.log

    Sorry if my post caused any confusion as that wasn't my intention.

    Those quotes were also well before 8900 saw the light of day.  8900 is already taking a long time, and is considerably smaller than a lot of recent non-bigadv SMP units (i.e. 71xx lines from Kasson's lab), and 1/25 the size of current bigadv units.
     
    You can't really compare size directly, as difficulty depends at least as much on the types of calculations being performed (just look at the 810x bigadv line, exact same atom count with 8104 having tpfs 1/2 that of 8101), but by any overall measure, 8900 is not big, and yet it already takes a relatively long time to complete.  Extrapolate 8900 to 500k atoms and you're looking at a titan needing 4+ days to finish.
     
    But regardless, the points you listed are meaningless.  Not only was the WU completely fake, but just assigning the values from one project to another has no meaning.  You could apply bigadv settings to a uniprocessor WU and get similar ridiculous numbers.  Does that mean uniprocessors will soon be making 1mppd+?  Proteneer did it as a joke, not as any indication of what the core is capable of, or as an indication of the future of GPU folding.  
     
    Too many people are simply accepting as fact that gpus are always significantly faster than CPUs.  There is no doubt that for certain simulations that is true, but extrapolating to all cases is simply wrong.
     
    I've heard no indication from proteneer, or anyone else official, that the current beta ppd is off in any way.  Any argument against that (that the ppd should be higher), that is based on calculations from fahbench, 7663, previous GPU cores, or worst of all 8052, is faulty logic.  PPD seems right in line with what they want for kepler/AMD.  Fermi's don't do well on core_17, but that's not an indication that the WU settings are off, just that fermi's do not do well on this core.
    post edited by Quisarious - 2013/05/31 19:39:36
    #21
    Panther-X
    New Member
    • Total Posts : 54
    • Reward points : 0
    • Joined: 2013/05/08 07:19:03
    • Status: offline
    • Ribbons : 1
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 20:04:18 (permalink)
    Quisarious ...You can't really compare size directly, as difficulty depends at least as much on the types of calculations being performed (just look at the 810x bigadv line, exact same atom count with 8104 having tpfs 1/2 that of 8101), but by any overall measure, 8900 is not big, and yet it already takes a relatively long time to complete.  Extrapolate 8900 to 500k atoms and you're looking at a titan needing 4+ days to finish...

    With the correct hardware/software, this stands true. Things may or may not change in the future so any speculations about it won't do any good.

    Quisarious ...Proteneer did it as a joke, not as any indication of what the core is capable of, or as an indication of the future of GPU folding...

    Right. Sorry for miscommunication. I have edited my post to remove this.

    Quisarious ...There is no doubt that for certain simulations that is true, but extrapolating to all cases is simply wrong...

    Using the right tool (CPU or GPU) for the right job is important. It is the scientists and programmers area of expertise to find the right tool and ours to use what we can to help progress science further.


    #22
    widsss
    iCX Member
    • Total Posts : 432
    • Reward points : 0
    • Joined: 2011/03/07 09:56:20
    • Status: offline
    • Ribbons : 0
    Re:UNSTABLE MACHINE - Dunno why 2013/05/31 20:49:16 (permalink)
    The reason why people want bigadv gpu is that it appears the only way to get fair points for the work performed.
     
    All of this debate could quickly by resolved if benchmark machine TPFs were made public.

     
    Intel i7 3770K @ 4.5 Ghz. / Asus P67 Pro / 2 x EVGA GTX 980 SC / 16gb Corsair Vengeance RAM  / 2 x Crucial 64 gb. SSD / 1 Tb. Seagate / Corsair AX 850  / Corsair H100

    Intel i7 3930k @ 4.3 Ghz / Asus Rampage IV Extreme / 2 x EVGA GTX 980 Ti SC  /  32 gb Gskill Ripjaws RAM / 256 gb Vertex 4 SSD / Corsair AX1200i / Corsair H110i GT

    #23
    Jump to:
  • Back to Mobile