Talonman
FTW Member
- Total Posts : 1391
- Reward points : 0
- Joined: 2008/04/01 09:26:53
- Location: Ohio
- Status: offline
- Ribbons : 31
Re:New OpenCL Ray Tracing App...
2010/01/12 14:23:25
(permalink)
Spongebob28 Did you know you can increase performance my increasing the Suggested work group size in the batch file. When running the app, I noticed it would suggest 192 for my 8800 GT and 384 for my 295 GTXs. I'm limited to 192 due to the 8800GT. If I trying anything higher, it crashes the app. P.S. The above scores are unaltered. I just ran the app as is. Yep.. I thought I was doing best running the RUN_SCENE_CORNELL_64SIZE. I might have to play around with that again.
post edited by Talonman - 2010/01/12 14:25:23
Asus ROG Maximus IX Hero Z270 / i7-7700K / Windows 10 Pro / EVGA GTX 1080 TI FTW3 Elite GPU / 32GB G.SKILL TridentZ RGB Series DDR4 3200MHz / EVGA Super Nova 850 G3 80 Plus Gold Modular PSU / Case: Phanteks Eclipse P400 in Red / (1) Samsung 960 EVO M.2 Internal SSD 500GB for OS (2) Samsung 850 EVO 1TB in RAID-0 for games / (1) Western Digital Black 7200 RPM 3TB Hard Drive for system backups - EVGA CLC 280 CPU Cooler. (EVGA affiliate code SKLZ84OQ2M)
|
chizow
CLASSIFIED Member
- Total Posts : 3768
- Reward points : 0
- Joined: 2007/01/27 20:15:08
- Status: offline
- Ribbons : 30
Re:New OpenCL Ray Tracing App...
2010/01/12 15:01:40
(permalink)
Talonman I found this post interesting... http://forums.nvidia.com/index.php?showtopic=108761 'avidday' I don't think so. Both NVIDIA and ATI are supplying special beta drivers with OpenCL support, but I don't believe either company have OpenCL support in their current release drivers. Further to that, I don't believe it is possible to build a single application that can run on either flavour of hardware. You can take OpenCL code and compile it with either vendor's SDK and it will probably work, but the back end code and support libraries are completely different and incompatible. Further to that, both flavours of OpenCL are really in beta and the performance and capabilities of both appears to be inferior to either vendors proprietary GPU compute environments (ie. CUDA or Stream). It gave me the impression that 1 code may work on both GPU's, but 1 GPU vendor will loose real bad in performance. It also confirmed for me that CUDA would be faster than OpenCL. Sorry, didn't read through the whole thread but you are probably onto something here, more detail here from AnandTech in their OpenCL update published some time ago: http://www.anandtech.com/weblog/showpost.aspx?i=648 AnandTech OpenCL ICD: Coming Soon Unfortunately matters are made particularly complex by the fact that on Windows and Linux, writing an OpenCL program right now requires linking against a vendor-specific OpenCL driver. The code itself is still cross-platform/cross-device, but in terms of compiling and linking OpenCL has not been fully abstracted. It’s not yet at the point where it’s possible to write and run a single Windows/Linux program that will work with any OpenCL device. It would be the equivalent of requiring an OpenGL game (e.g. Quake) to have a different binary for each GPU vendor’s drivers. The solution to this problem is that OpenCL needs an Installable Client Driver (ICD), just like OpenGL does. With an ICD developers can link against that, and it will handle the duties of passing things off to vendor-specific drivers. However an ICD isn’t ready yet, and in fact we don’t know when it will be ready. NVIDIA - who chairs the OpenCL working group - tells us that the WG is “driving to get an ICD implementation released as quickly as possible”, but with no timetable attached to that. The effort right now appears to be on getting more OpenCL 1.0 implementations certified (NV is certified, AMD is in progress), with an ICD to follow. As is the case more often than not, the promise of an "open standard panacea" falls short of expectations. While OpenCL does provide a common language across platforms it still relies on vendor-specific support and an additional abstraction layer for full performance realization. If the apps author used ATI's SDK and compiler (which he must've since he developed and debugged with a 4800 part) that most likely explains why performance is so abysmal on Nvidia parts and essentially runs in software on the CPU. You will probably have to have the author or someone else with Nvidia hardware and sufficient programming knowledge re-write the app or if you know which headers need to be abstracted out, that might save some time, then recompile with Nvidia's OpenCL SDK.
Intel Core i7 5930K @4.5GHz | Gigabyte X99 Gaming 5 | Win8.1 Pro x64 | Corsair H105 2x Nvidia GeForce Titan X SLI | Asus ROG Swift 144Hz 3D Vision G-Sync LCD | 2xDell U2410 | 32GB Acer XPG DDR4 2800 Samsung 850 Pro 256GB | Samsung 840EVO 4x1TB RAID 0 | Seagate 2TB SSHD Yamaha VSX-677 A/V Receiver | Polk Audio RM6880 7.1 | LG Super Multi Blu-Ray Auzen X-Fi HT HD | Logitech G710/G502/G27/G930 | Corsair Air 540 | EVGA SuperNOVA P2 1200W
|
Talonman
FTW Member
- Total Posts : 1391
- Reward points : 0
- Joined: 2008/04/01 09:26:53
- Location: Ohio
- Status: offline
- Ribbons : 31
Re:New OpenCL Ray Tracing App...
2010/01/12 15:15:38
(permalink)
That is a good thought on compiling using Nvidia's SDK. This short thread is more talk of the problem's with vendors locking down their OpenCL implementation, if I understand it correctly. http://forums.nvidia.com/.dex.php?showtopic=154565
post edited by Talonman - 2010/01/12 17:43:55
Asus ROG Maximus IX Hero Z270 / i7-7700K / Windows 10 Pro / EVGA GTX 1080 TI FTW3 Elite GPU / 32GB G.SKILL TridentZ RGB Series DDR4 3200MHz / EVGA Super Nova 850 G3 80 Plus Gold Modular PSU / Case: Phanteks Eclipse P400 in Red / (1) Samsung 960 EVO M.2 Internal SSD 500GB for OS (2) Samsung 850 EVO 1TB in RAID-0 for games / (1) Western Digital Black 7200 RPM 3TB Hard Drive for system backups - EVGA CLC 280 CPU Cooler. (EVGA affiliate code SKLZ84OQ2M)
|
chizow
CLASSIFIED Member
- Total Posts : 3768
- Reward points : 0
- Joined: 2007/01/27 20:15:08
- Status: offline
- Ribbons : 30
Re:New OpenCL Ray Tracing App...
2010/01/12 15:21:50
(permalink)
Check out post #4 by pcchen, that basically reiterates the AnandTech blurb on ICDs. OpenCL code still needs to link to vendor specific .dlls to tell the hardware how to run the code. I'm guessing when you compile with any specific vendor's compiler it inserts the necessary links, but when run on another vendor's hardware, those links obviously don't work, so you need an ICD to abstract out those headers and replace them with the necessary Nvidia links/headers.
Intel Core i7 5930K @4.5GHz | Gigabyte X99 Gaming 5 | Win8.1 Pro x64 | Corsair H105 2x Nvidia GeForce Titan X SLI | Asus ROG Swift 144Hz 3D Vision G-Sync LCD | 2xDell U2410 | 32GB Acer XPG DDR4 2800 Samsung 850 Pro 256GB | Samsung 840EVO 4x1TB RAID 0 | Seagate 2TB SSHD Yamaha VSX-677 A/V Receiver | Polk Audio RM6880 7.1 | LG Super Multi Blu-Ray Auzen X-Fi HT HD | Logitech G710/G502/G27/G930 | Corsair Air 540 | EVGA SuperNOVA P2 1200W
|
Talonman
FTW Member
- Total Posts : 1391
- Reward points : 0
- Joined: 2008/04/01 09:26:53
- Location: Ohio
- Status: offline
- Ribbons : 31
Re:New OpenCL Ray Tracing App...
2010/01/12 15:26:46
(permalink)
Thanks again chizow. You never cease to amaze me.
Asus ROG Maximus IX Hero Z270 / i7-7700K / Windows 10 Pro / EVGA GTX 1080 TI FTW3 Elite GPU / 32GB G.SKILL TridentZ RGB Series DDR4 3200MHz / EVGA Super Nova 850 G3 80 Plus Gold Modular PSU / Case: Phanteks Eclipse P400 in Red / (1) Samsung 960 EVO M.2 Internal SSD 500GB for OS (2) Samsung 850 EVO 1TB in RAID-0 for games / (1) Western Digital Black 7200 RPM 3TB Hard Drive for system backups - EVGA CLC 280 CPU Cooler. (EVGA affiliate code SKLZ84OQ2M)
|
chizow
CLASSIFIED Member
- Total Posts : 3768
- Reward points : 0
- Joined: 2007/01/27 20:15:08
- Status: offline
- Ribbons : 30
Re:New OpenCL Ray Tracing App...
2010/01/12 16:10:53
(permalink)
Np Talonman, just picked up on your lead, hopefully it helps you find a solution. Now you just need to bug Fred or someone else to re-write or re-compile the app for Nvidia hardware. Also in that post you linked, #6 from iAPX is also covered in that AT article about Apple's OpenCL implementation and why it actually works for both GPU vendors and CPU without any specific ICD: AnandTech Meanwhile Apple, in the traditional Apple manner, has simply done a runaround on the whole issue. When it comes to drivers they shipped Snow Leopard with their own OpenCL CPU driver, and they have GPU drivers for both AMD and NVIDIA cards. Their OpenCL framework doesn’t have an ICD per-say, but it has features that allow developers to query for devices and use any they like. It effectively accomplishes the same thing, but it’s only of use when writing programs against Apple’s framework. But to Apple’s credit, as of this moment they currently have the only complete OpenCL platform, offering CPU+GPU development and execution with a full degree of abstraction.
Intel Core i7 5930K @4.5GHz | Gigabyte X99 Gaming 5 | Win8.1 Pro x64 | Corsair H105 2x Nvidia GeForce Titan X SLI | Asus ROG Swift 144Hz 3D Vision G-Sync LCD | 2xDell U2410 | 32GB Acer XPG DDR4 2800 Samsung 850 Pro 256GB | Samsung 840EVO 4x1TB RAID 0 | Seagate 2TB SSHD Yamaha VSX-677 A/V Receiver | Polk Audio RM6880 7.1 | LG Super Multi Blu-Ray Auzen X-Fi HT HD | Logitech G710/G502/G27/G930 | Corsair Air 540 | EVGA SuperNOVA P2 1200W
|
Talonman
FTW Member
- Total Posts : 1391
- Reward points : 0
- Joined: 2008/04/01 09:26:53
- Location: Ohio
- Status: offline
- Ribbons : 31
Re:New OpenCL Ray Tracing App...
2010/01/12 16:13:10
(permalink)
Asus ROG Maximus IX Hero Z270 / i7-7700K / Windows 10 Pro / EVGA GTX 1080 TI FTW3 Elite GPU / 32GB G.SKILL TridentZ RGB Series DDR4 3200MHz / EVGA Super Nova 850 G3 80 Plus Gold Modular PSU / Case: Phanteks Eclipse P400 in Red / (1) Samsung 960 EVO M.2 Internal SSD 500GB for OS (2) Samsung 850 EVO 1TB in RAID-0 for games / (1) Western Digital Black 7200 RPM 3TB Hard Drive for system backups - EVGA CLC 280 CPU Cooler. (EVGA affiliate code SKLZ84OQ2M)
|
luv2increase
CLASSIFIED Member
- Total Posts : 2643
- Reward points : 0
- Joined: 2008/12/31 16:26:56
- Status: offline
- Ribbons : 8
Re:New OpenCL Ray Tracing App...
2010/01/12 16:16:46
(permalink)
pcchen wrote: To my understanding, there shouldn't be a difference (at least on Windows) between using AMD or NVIDIA's SDK. Basically the cl.h is almost the same (they are different in only one line, which is a comment). The opencl.lib files are different, but they both linked to opencl.dll, with completely the same functions and calling conventions. So there shouldn't be any benefit from recompiling with different SDK. I think that is why we are seeing the same low performance for OpenCL on both the DirectCompute benchmark and SmallptGpu.
HEATWARE - Intel Core i7 920 @ 4.1Ghz 24/7 * Have x5650 Xeon 6c/12t want to install!!! - Corsair Dominator 12GB - EVGA x58 Classified 760 - MSI GTX 960 - MegaRAID 9260-8i Raid Card - 4 x Samsung 850 EVO 120GB in Raid-0 - 4 x Samsung EcoGreen 1.5TB - Thermaltake Toughpower 1200W - IKONIK Ra X10 SIM - Pioneer BD-RW - 46" Samsung LN46A630 1080p - Windows 10 Professional Build 10147
|
Nahte27
FTW Member
- Total Posts : 1582
- Reward points : 0
- Joined: 2009/03/20 14:13:46
- Location: Oklahoma
- Status: offline
- Ribbons : 9
Re:New OpenCL Ray Tracing App...
2010/01/12 19:36:09
(permalink)
So, I've been following this thread for a while now. Interesting stuff IMO. Today I was downloading some updates from the Luxrender website and I saw this: Luxrender and Open CL I didn't realize that one of the reasons for this Open CL project is an Open CL powered version of Luxrender! Very cool stuff with a lot of potential! Keep up the good work guys!
|
Talonman
FTW Member
- Total Posts : 1391
- Reward points : 0
- Joined: 2008/04/01 09:26:53
- Location: Ohio
- Status: offline
- Ribbons : 31
Re:New OpenCL Ray Tracing App...
2010/01/12 21:14:30
(permalink)
Thanks for the link Nahte27... "This is about 45 time faster than SmallptCPU running on an Intel Q6600. Talonman has achieved some really high number of samples (on a very simple scene however) by using one of the 2 GPUs available with a NVIDIA GTX 295 114,000,000 of samples/sec is really an high number." I might get some street cred as a BETA Tester! I'm thrilled, thanks Luxrender....
post edited by Talonman - 2010/01/12 21:22:00
Asus ROG Maximus IX Hero Z270 / i7-7700K / Windows 10 Pro / EVGA GTX 1080 TI FTW3 Elite GPU / 32GB G.SKILL TridentZ RGB Series DDR4 3200MHz / EVGA Super Nova 850 G3 80 Plus Gold Modular PSU / Case: Phanteks Eclipse P400 in Red / (1) Samsung 960 EVO M.2 Internal SSD 500GB for OS (2) Samsung 850 EVO 1TB in RAID-0 for games / (1) Western Digital Black 7200 RPM 3TB Hard Drive for system backups - EVGA CLC 280 CPU Cooler. (EVGA affiliate code SKLZ84OQ2M)
|
Nahte27
FTW Member
- Total Posts : 1582
- Reward points : 0
- Joined: 2009/03/20 14:13:46
- Location: Oklahoma
- Status: offline
- Ribbons : 9
Re:New OpenCL Ray Tracing App...
2010/01/13 16:03:48
(permalink)
Talonman "This is about 45 time faster than SmallptCPU running on an Intel Q6600. Talonman has achieved some really high number of samples (on a very simple scene however) by using one of the 2 GPUs available with a NVIDIA GTX 295 I might get some street cred as a BETA Tester! Wow, I read through the whole thing, and I can't believe I missed your name in there! That's pretty cool! If you guys can really help to get Luxrender to work on GPUs, you'll all be heroes in my book!
|
Talonman
FTW Member
- Total Posts : 1391
- Reward points : 0
- Joined: 2008/04/01 09:26:53
- Location: Ohio
- Status: offline
- Ribbons : 31
Re:New OpenCL Ray Tracing App...
2010/01/13 21:14:37
(permalink)
Thanks, I'm just glad we may have helped to speed up it's development.
Asus ROG Maximus IX Hero Z270 / i7-7700K / Windows 10 Pro / EVGA GTX 1080 TI FTW3 Elite GPU / 32GB G.SKILL TridentZ RGB Series DDR4 3200MHz / EVGA Super Nova 850 G3 80 Plus Gold Modular PSU / Case: Phanteks Eclipse P400 in Red / (1) Samsung 960 EVO M.2 Internal SSD 500GB for OS (2) Samsung 850 EVO 1TB in RAID-0 for games / (1) Western Digital Black 7200 RPM 3TB Hard Drive for system backups - EVGA CLC 280 CPU Cooler. (EVGA affiliate code SKLZ84OQ2M)
|
Nahte27
FTW Member
- Total Posts : 1582
- Reward points : 0
- Joined: 2009/03/20 14:13:46
- Location: Oklahoma
- Status: offline
- Ribbons : 9
Re:New OpenCL Ray Tracing App...
2010/02/11 13:56:16
(permalink)
|
Talonman
FTW Member
- Total Posts : 1391
- Reward points : 0
- Joined: 2008/04/01 09:26:53
- Location: Ohio
- Status: offline
- Ribbons : 31
Re:New OpenCL Ray Tracing App...
2010/02/11 16:17:58
(permalink)
Do you see a link to download that version? I can't seem to find it. It looks good.
Asus ROG Maximus IX Hero Z270 / i7-7700K / Windows 10 Pro / EVGA GTX 1080 TI FTW3 Elite GPU / 32GB G.SKILL TridentZ RGB Series DDR4 3200MHz / EVGA Super Nova 850 G3 80 Plus Gold Modular PSU / Case: Phanteks Eclipse P400 in Red / (1) Samsung 960 EVO M.2 Internal SSD 500GB for OS (2) Samsung 850 EVO 1TB in RAID-0 for games / (1) Western Digital Black 7200 RPM 3TB Hard Drive for system backups - EVGA CLC 280 CPU Cooler. (EVGA affiliate code SKLZ84OQ2M)
|
chumbucket843
iCX Member
- Total Posts : 469
- Reward points : 0
- Joined: 2009/04/15 19:21:56
- Status: offline
- Ribbons : 0
Re:New OpenCL Ray Tracing App...
2010/02/11 16:31:40
(permalink)
small luxgpu is already in beta and i have not tried this yet but you might be able to export a .py from blender or some other program and run it on this. it comes with some batch files with existing scenes. my gtx 260 is 4x faster than my core i7. http://forum.beyond3d.com/showpost.php?p=1387162&postcount=240
Core i7 D0 EVGA X58 LE EVGA GTX260\\folding 3x2 GB DDR3 *10 real cores folding
|
Talonman
FTW Member
- Total Posts : 1391
- Reward points : 0
- Joined: 2008/04/01 09:26:53
- Location: Ohio
- Status: offline
- Ribbons : 31
Re:New OpenCL Ray Tracing App...
2010/02/11 17:03:01
(permalink)
Asus ROG Maximus IX Hero Z270 / i7-7700K / Windows 10 Pro / EVGA GTX 1080 TI FTW3 Elite GPU / 32GB G.SKILL TridentZ RGB Series DDR4 3200MHz / EVGA Super Nova 850 G3 80 Plus Gold Modular PSU / Case: Phanteks Eclipse P400 in Red / (1) Samsung 960 EVO M.2 Internal SSD 500GB for OS (2) Samsung 850 EVO 1TB in RAID-0 for games / (1) Western Digital Black 7200 RPM 3TB Hard Drive for system backups - EVGA CLC 280 CPU Cooler. (EVGA affiliate code SKLZ84OQ2M)
|
Nahte27
FTW Member
- Total Posts : 1582
- Reward points : 0
- Joined: 2009/03/20 14:13:46
- Location: Oklahoma
- Status: offline
- Ribbons : 9
Re:New OpenCL Ray Tracing App...
2010/02/11 21:34:14
(permalink)
chumbucket843 small luxgpu is already in beta and i have not tried this yet but you might be able to export a .py from blender or some other program and run it on this. it comes with some batch files with existing scenes. my gtx 260 is 4x faster than my core i7. http://forum.beyond3d.com/showpost.php?p=1387162&postcount=240 I think .py is a python script. Luxrender is .lxs... Do you know if luxgpu supports.lsx? I'd love to try some of my own scenes and see how they do...
|
Talonman
FTW Member
- Total Posts : 1391
- Reward points : 0
- Joined: 2008/04/01 09:26:53
- Location: Ohio
- Status: offline
- Ribbons : 31
Re:New OpenCL Ray Tracing App...
2010/02/13 06:04:56
(permalink)
Nahte27 So, I've been following this thread for a while now. Interesting stuff IMO. Today I was downloading some updates from the Luxrender website and I saw this: Luxrender and Open CL I didn't realize that one of the reasons for this Open CL project is an Open CL powered version of Luxrender! Very cool stuff with a lot of potential! Keep up the good work guys! Dave has a message for you. http://forum.beyond3d.com...mp;t=55913&page=12 "Ah, thanks, could you write there to Nathe72 that the ".py" file is a ".ply" and there is an SmallLuxGPU exporter for Blender 2.5 available here: http://www.luxrender.net/forum/viewt...hp?f=34&t=3420 LuxrenderGPU uses instead the same .lxs file of Luxrender".
Asus ROG Maximus IX Hero Z270 / i7-7700K / Windows 10 Pro / EVGA GTX 1080 TI FTW3 Elite GPU / 32GB G.SKILL TridentZ RGB Series DDR4 3200MHz / EVGA Super Nova 850 G3 80 Plus Gold Modular PSU / Case: Phanteks Eclipse P400 in Red / (1) Samsung 960 EVO M.2 Internal SSD 500GB for OS (2) Samsung 850 EVO 1TB in RAID-0 for games / (1) Western Digital Black 7200 RPM 3TB Hard Drive for system backups - EVGA CLC 280 CPU Cooler. (EVGA affiliate code SKLZ84OQ2M)
|
Nahte27
FTW Member
- Total Posts : 1582
- Reward points : 0
- Joined: 2009/03/20 14:13:46
- Location: Oklahoma
- Status: offline
- Ribbons : 9
Re:New OpenCL Ray Tracing App...
2010/02/13 09:09:50
(permalink)
|
chumbucket843
iCX Member
- Total Posts : 469
- Reward points : 0
- Joined: 2009/04/15 19:21:56
- Status: offline
- Ribbons : 0
Re:New OpenCL Ray Tracing App...
2010/02/13 10:30:47
(permalink)
well, hook us up with some cool renders plz.
Core i7 D0 EVGA X58 LE EVGA GTX260\\folding 3x2 GB DDR3 *10 real cores folding
|
Nahte27
FTW Member
- Total Posts : 1582
- Reward points : 0
- Joined: 2009/03/20 14:13:46
- Location: Oklahoma
- Status: offline
- Ribbons : 9
Re:New OpenCL Ray Tracing App...
2010/02/13 13:18:14
(permalink)
Alright, it was a bit of an ordeal but I finished a render with SmallLuxGPU and wow is that fast! It certainly has potential! I had to move to Blender 2.5 to get the exporter to work, so there was a bit of a learning curve for me. Not to mention most of my scenes were incompatible for one reason or another. I had to heavily modify a scene I'm currently working on. There are no complex materials, and the camera view was off, so you can see parts you're not supposed to, but with only an hour of rendering, I'd say it turned out pretty good! And I was surprised to see it was using my 275 AND 250! How awesome is that!
|