Hot!SR-2 - 96GB Secret Sauce

Page: < 123 Showing page 3 of 3
Author
takuhari
ACX Member
  • Total Posts : 396
  • Reward points : 0
  • Joined: 2010/06/01 12:38:29
  • Location: Barstow CA 92311
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/07/05 18:49:39 (permalink)
I was able to boot into ubuntu 16.4 with 96gb r2x4 hynix... windows will give me a bsod... any reset will boot loop and not post. Like earlier people, the start after the cmos clear works fine. Ubuntu runs good with it and it detects. I have e6520s so not the highend cpus but does have the 6 cores.

I am running at 1333mhz
Bios and ubuntu detects all 96gb
#61
takuhari
ACX Member
  • Total Posts : 396
  • Reward points : 0
  • Joined: 2010/06/01 12:38:29
  • Location: Barstow CA 92311
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/07/05 18:52:57 (permalink)
I saw instructions on how to install vmware workstation on ubuntu... so i guess i dont need windows.

Maybe i will just install esxi host and have my vc implimented on my host ina nested environment.
#62
gordan79
SSC Member
  • Total Posts : 526
  • Reward points : 0
  • Joined: 2013/01/27 00:17:36
  • Status: offline
  • Ribbons : 3
Re: SR-2 - 96GB Secret Sauce 2016/07/06 00:58:49 (permalink)
I never tried Windows, but it works fine for me on CentOS 6 and CentOS 7. There is a good chance you have a DIMM or CPU that isn't quite properly seated. If you decide to use ESXi, you should be aware that it is "headless", so you will need another machine to use as a workstation. PCI passthrough of GPUs will not work with the SR-2 and ESXi (even if you have Quadros). It can be made to work with Xen and KVM, but you will need QEMU 2.2 or later and limit the low (below 4GB) memory to something like 1GB to work around an IOMMU DMA routing bug caused by the Nvidia NF200 PCIe bridges. This bug will also make most SAS controllers (tried Adaptec, LSI and 3ware) either outright not work or cause a crash not long after starting VMs when IOMMU is activated (and you need IOMMU to use PCI passthrough).
 
Or to put it another way, the SR-2 is a very poor choice of motherboard for virtualization. I have my setup working quite stably, but it took me weeks to figure out and work around the bugs.

Supermicro X8DTH-6, 2x X5690
Crucial 12x 8GB x4 DR 1.35V DDR3-1600 ECC RDIMMs (96GB)
2x GTX 780Ti, 1x GTX 980Ti
Triple-Seat Virtualized With VGA Passthrough (KVM)
#63
takuhari
ACX Member
  • Total Posts : 396
  • Reward points : 0
  • Joined: 2010/06/01 12:38:29
  • Location: Barstow CA 92311
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/07/07 02:25:18 (permalink)
gordan79
... There is a good chance you have a DIMM or CPU that isn't quite properly seated. If you decide to use ESXi, you should be aware that it is "headless", so you will need another machine to use as a workstation....
Or to put it another way, the SR-2 is a very poor choice of motherboard for virtualization. I have my setup working quite stably, but it took me weeks to figure out and work around the bugs.


Other than the initial start, this seems to work fully.
I doubt that anything is seated wrong. I have full functionality with ubuntu with 94.4gb of usable memory... assuming that the rest is lost in conversion. I currently am running workstation 12 on ubuntu. I have 6 hosts implimented and multiple windows machines on each host with FT implemented.
 
As for the stability of the board... no issue running this for a few days on full load... just that issue when resetting. I tried delaying the post so that the memory can initialize.
 
#64
gordan79
SSC Member
  • Total Posts : 526
  • Reward points : 0
  • Joined: 2013/01/27 00:17:36
  • Status: offline
  • Ribbons : 3
Re: SR-2 - 96GB Secret Sauce 2016/07/07 02:33:33 (permalink)
Indeed, running with more than 48GB of RAM will make the initial POST intermittent, and it will _never_ POST on the first try. If you watch the diagnostic display, it will give up several times and retry before it finally succeeds. The only time it will POST first time is right after a CMOS reset. After it manages to POST, it will continue to work fine, and will POST fine after soft-resets. Hard resets will struggle to POST, same as cold-boots.
 
This is due to a BIOS bug that we are unlikely to ever see fixed.

Supermicro X8DTH-6, 2x X5690
Crucial 12x 8GB x4 DR 1.35V DDR3-1600 ECC RDIMMs (96GB)
2x GTX 780Ti, 1x GTX 980Ti
Triple-Seat Virtualized With VGA Passthrough (KVM)
#65
takuhari
ACX Member
  • Total Posts : 396
  • Reward points : 0
  • Joined: 2010/06/01 12:38:29
  • Location: Barstow CA 92311
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/07/07 10:42:32 (permalink)
Something like that
#66
takuhari
ACX Member
  • Total Posts : 396
  • Reward points : 0
  • Joined: 2010/06/01 12:38:29
  • Location: Barstow CA 92311
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/07/07 10:44:34 (permalink)
I wonder if they have a bois update to compensate for the larger ram chips... I believe the issues is bios related and not hardware.
#67
takuhari
ACX Member
  • Total Posts : 396
  • Reward points : 0
  • Joined: 2010/06/01 12:38:29
  • Location: Barstow CA 92311
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/07/07 11:07:53 (permalink)

It looks like spaghetti. And as you can see my ram is being almost maxed out from my virtual machines... Currently running almost a week without failure.
 
post edited by takuhari - 2016/07/07 20:00:55
#68
gordan79
SSC Member
  • Total Posts : 526
  • Reward points : 0
  • Joined: 2013/01/27 00:17:36
  • Status: offline
  • Ribbons : 3
Re: SR-2 - 96GB Secret Sauce 2016/07/08 00:08:54 (permalink)
Yes, it is a BIOS bug. No, it won't get fixed. SR-2 has long been out of production, and not even warranty replacements were available last time I checked (so much for that 10 year warranty). POST-ing with 96GB of RAM also isn't the only bug. There are much more trivial ones that are long standing and weren't fixed even back when BIOS updates were still being worked on.
 
If you want something that will just work, ebay the board to somebody who doesn't know better, and replace it with a similar Supermicro board. Top of the line X5690 CPUs are no longer that expensive, so the benefit of overclocking is negligible anyway. I've been planning to do this for a long time, but finding a day to waste on rebuilding a server isn't as easy.

Supermicro X8DTH-6, 2x X5690
Crucial 12x 8GB x4 DR 1.35V DDR3-1600 ECC RDIMMs (96GB)
2x GTX 780Ti, 1x GTX 980Ti
Triple-Seat Virtualized With VGA Passthrough (KVM)
#69
takuhari
ACX Member
  • Total Posts : 396
  • Reward points : 0
  • Joined: 2010/06/01 12:38:29
  • Location: Barstow CA 92311
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/07/08 02:28:47 (permalink)
I think I will hang on to this one...
So far... other than the reset, I havn't had any problems or crash.
All of my VMs are in good standing, so it is working for what I need it for.
No need to sink money into something else.
I am running multiple windows systems and different flavors of unix. I never power the system down anywhos... lol
#70
gordan79
SSC Member
  • Total Posts : 526
  • Reward points : 0
  • Joined: 2013/01/27 00:17:36
  • Status: offline
  • Ribbons : 3
Re: SR-2 - 96GB Secret Sauce 2016/07/10 00:22:54 (permalink)
One thing that will break spectacularly is PCI passthrough of devices with memory apertures. There is a very serious bug in the NF200 PCIE bridges that causes them to bypass the IOMMU on PCIE DMA transfers. Most SAS controllers will cause a lock-up or just won't work at all with the IOMMU enabled. If you ever passthrough GPUs to your VMs you will have to use special workarounds to prevent the VM memory from overlapping the PCI IOMEM regions or the whole machine will crash as soon as the VM uses virtual memory space that overlaps with the physical IOMEM regions on the host.
 
As you can see from my signature, I managed to get this working reliable, but in hindsight, the weeks I spent figuring out the bug and working around it were worth far more than the cost difference to a good server board and top of the line Xeons to compensate for the lack of OC-ing features.

Supermicro X8DTH-6, 2x X5690
Crucial 12x 8GB x4 DR 1.35V DDR3-1600 ECC RDIMMs (96GB)
2x GTX 780Ti, 1x GTX 980Ti
Triple-Seat Virtualized With VGA Passthrough (KVM)
#71
takuhari
ACX Member
  • Total Posts : 396
  • Reward points : 0
  • Joined: 2010/06/01 12:38:29
  • Location: Barstow CA 92311
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/07/11 23:39:47 (permalink)
wow... how did you get the reset to work without clearing the cmos?
I just never turn this thing off. Still no crash yet.
#72
gordan79
SSC Member
  • Total Posts : 526
  • Reward points : 0
  • Joined: 2013/01/27 00:17:36
  • Status: offline
  • Ribbons : 3
Re: SR-2 - 96GB Secret Sauce 2016/07/12 00:47:20 (permalink)
Re-read the beginning of the thread. The key part is setting all the memory settings manually, and setting the memory command rate to 2T. That will make it complete the POST within 4-5 attempts most of the time.

Supermicro X8DTH-6, 2x X5690
Crucial 12x 8GB x4 DR 1.35V DDR3-1600 ECC RDIMMs (96GB)
2x GTX 780Ti, 1x GTX 980Ti
Triple-Seat Virtualized With VGA Passthrough (KVM)
#73
brunoluansm
New Member
  • Total Posts : 10
  • Reward points : 0
  • Joined: 2013/12/23 08:58:17
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/07/28 15:24:45 (permalink)
Great!
#74
skulstation2
Superclocked Member
  • Total Posts : 147
  • Reward points : 0
  • Joined: 2014/09/17 00:59:03
  • Status: offline
  • Ribbons : 1
Re: SR-2 - 96GB Secret Sauce 2016/10/06 10:41:10 (permalink)
i just joint the 96gb club.
all ram settings on auto.
only vcore ( 1,45v ) and vtt ( 1.35v ) ar not on auto.whit this settings the cpu can do +4,4ghz

 
 
#75
roswellian2002
New Member
  • Total Posts : 2
  • Reward points : 0
  • Joined: 2016/08/27 21:57:58
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/10/06 23:01:50 (permalink)
gordan79
One thing that will break spectacularly is PCI passthrough of devices with memory apertures. There is a very serious bug in the NF200 PCIE bridges that causes them to bypass the IOMMU on PCIE DMA transfers. Most SAS controllers will cause a lock-up or just won't work at all with the IOMMU enabled. If you ever passthrough GPUs to your VMs you will have to use special workarounds to prevent the VM memory from overlapping the PCI IOMEM regions or the whole machine will crash as soon as the VM uses virtual memory space that overlaps with the physical IOMEM regions on the host.
 
As you can see from my signature, I managed to get this working reliable, but in hindsight, the weeks I spent figuring out the bug and working around it were worth far more than the cost difference to a good server board and top of the line Xeons to compensate for the lack of OC-ing features.


I'm really happy to come across this post. I originally planned to get a used SR-2 for GPU passthrough on ESXi...After reading the posts all, it seems to be impossible. I still have one more question...what if I assign > 4G memory for a VM, will passthrough work? Thanks.
#76
gordan79
SSC Member
  • Total Posts : 526
  • Reward points : 0
  • Joined: 2013/01/27 00:17:36
  • Status: offline
  • Ribbons : 3
Re: SR-2 - 96GB Secret Sauce 2016/10/10 05:16:18 (permalink)
roswellian2002
gordan79
One thing that will break spectacularly is PCI passthrough of devices with memory apertures. There is a very serious bug in the NF200 PCIE bridges that causes them to bypass the IOMMU on PCIE DMA transfers. Most SAS controllers will cause a lock-up or just won't work at all with the IOMMU enabled. If you ever passthrough GPUs to your VMs you will have to use special workarounds to prevent the VM memory from overlapping the PCI IOMEM regions or the whole machine will crash as soon as the VM uses virtual memory space that overlaps with the physical IOMEM regions on the host.
 
As you can see from my signature, I managed to get this working reliable, but in hindsight, the weeks I spent figuring out the bug and working around it were worth far more than the cost difference to a good server board and top of the line Xeons to compensate for the lack of OC-ing features.


I'm really happy to come across this post. I originally planned to get a used SR-2 for GPU passthrough on ESXi...After reading the posts all, it seems to be impossible. I still have one more question...what if I assign > 4G memory for a VM, will passthrough work? Thanks.

 
Short answer:
Don't get an SR-2.
 
Long answer:
IIRC ESXi doesn't work with GPU passthrough unless the IOMMU supports ACS. NF200 bridges don't, and thus (at least recent versions of) ESXi will flat out not do it, period, regardless of any workarounds. Xen and KVM can be persuaded to ignore the lack of ACS (and the security impact of doing so).
 
Without any additional workarounds, you should be able to get PCI passthrough working with VMs allocated (4GB - PCI memory selected in BIOS). So if you set it in BIOS to 3G (max), giving a VM more than 1GB is risky if you are using PCI passthrough. If you set the PCI memory gap to 1GB you may be able to get away with giving the VMs up to 3GB. Other issues such as various lock-ups dependant on what other hardware you have may manifest regardless.
 
With recent Xen and KVM you can work around the problem by using a QEMU option to limit low memory allocation (e.g. to 1GB) to make sure the VM's memory map never includes any PCI aperture address ranges.
 
Lack of ACS still has security implications, though.
 
So with ESXi you have no hope because the hardware is to broken. With Xen or KVM you can get away with it, but make sure you understand the ramifications and limitations.

Supermicro X8DTH-6, 2x X5690
Crucial 12x 8GB x4 DR 1.35V DDR3-1600 ECC RDIMMs (96GB)
2x GTX 780Ti, 1x GTX 980Ti
Triple-Seat Virtualized With VGA Passthrough (KVM)
#77
roswellian2002
New Member
  • Total Posts : 2
  • Reward points : 0
  • Joined: 2016/08/27 21:57:58
  • Status: offline
  • Ribbons : 0
Re: SR-2 - 96GB Secret Sauce 2016/10/13 20:43:59 (permalink)
gordan79
roswellian2002
gordan79
One thing that will break spectacularly is PCI passthrough of devices with memory apertures. There is a very serious bug in the NF200 PCIE bridges that causes them to bypass the IOMMU on PCIE DMA transfers. Most SAS controllers will cause a lock-up or just won't work at all with the IOMMU enabled. If you ever passthrough GPUs to your VMs you will have to use special workarounds to prevent the VM memory from overlapping the PCI IOMEM regions or the whole machine will crash as soon as the VM uses virtual memory space that overlaps with the physical IOMEM regions on the host.
 
As you can see from my signature, I managed to get this working reliable, but in hindsight, the weeks I spent figuring out the bug and working around it were worth far more than the cost difference to a good server board and top of the line Xeons to compensate for the lack of OC-ing features.


I'm really happy to come across this post. I originally planned to get a used SR-2 for GPU passthrough on ESXi...After reading the posts all, it seems to be impossible. I still have one more question...what if I assign > 4G memory for a VM, will passthrough work? Thanks.

 
Short answer:
Don't get an SR-2.
 
Long answer:
IIRC ESXi doesn't work with GPU passthrough unless the IOMMU supports ACS. NF200 bridges don't, and thus (at least recent versions of) ESXi will flat out not do it, period, regardless of any workarounds. Xen and KVM can be persuaded to ignore the lack of ACS (and the security impact of doing so).
 
Without any additional workarounds, you should be able to get PCI passthrough working with VMs allocated (4GB - PCI memory selected in BIOS). So if you set it in BIOS to 3G (max), giving a VM more than 1GB is risky if you are using PCI passthrough. If you set the PCI memory gap to 1GB you may be able to get away with giving the VMs up to 3GB. Other issues such as various lock-ups dependant on what other hardware you have may manifest regardless.
 
With recent Xen and KVM you can work around the problem by using a QEMU option to limit low memory allocation (e.g. to 1GB) to make sure the VM's memory map never includes any PCI aperture address ranges.
 
Lack of ACS still has security implications, though.
 
So with ESXi you have no hope because the hardware is to broken. With Xen or KVM you can get away with it, but make sure you understand the ramifications and limitations.




Thanks a lot, gordon! That's really helpful. I'm moving to supermicro 2011 platform now.
#78
gordan79
SSC Member
  • Total Posts : 526
  • Reward points : 0
  • Joined: 2013/01/27 00:17:36
  • Status: offline
  • Ribbons : 3
Re: SR-2 - 96GB Secret Sauce 2016/10/13 23:51:32 (permalink)
As you can see from my signature, I moved from the SR-2 to a similar Supermicro board myself.

Supermicro X8DTH-6, 2x X5690
Crucial 12x 8GB x4 DR 1.35V DDR3-1600 ECC RDIMMs (96GB)
2x GTX 780Ti, 1x GTX 980Ti
Triple-Seat Virtualized With VGA Passthrough (KVM)
#79
Page: < 123 Showing page 3 of 3
Jump to:
  • Back to Mobile