the_Scarlet_one
formerly Scarlet-tech
- Total Posts : 24581
- Reward points : 0
- Joined: 2013/11/13 02:48:57
- Location: East Coast
- Status: offline
- Ribbons : 79
Re: Geforce Drivers 4xx.xx Drop more than 2/3 in CUDA Performance from the 3xx.xx Drvers.
2018/11/28 08:06:05
(permalink)
Looking at the error through google, it used to come up quite often, but it doesn’t seem to come up as often.
Can you give me some settings to pass onto them? I am only the middle man unfortunately. I can only give them what I am told to pass on.
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 08:07:44
(permalink)
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 12:50:26
(permalink)
|
the_Scarlet_one
formerly Scarlet-tech
- Total Posts : 24581
- Reward points : 0
- Joined: 2013/11/13 02:48:57
- Location: East Coast
- Status: offline
- Ribbons : 79
Re: Geforce Drivers 4xx.xx Drop more than 2/3 in CUDA Performance from the 3xx.xx Drvers.
2018/11/28 13:20:19
(permalink)
Did 417.01 correct all of the issues you were having?
All of your numbers above are very close together.
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 13:22:50
(permalink)
the_Scarlet_one Did 417.01 correct all of the issues you were having? All of your numbers above are very close together.
No, not on the AP App from PG. I am running All the other BOINC Projects to see what they are looking like.
post edited by bcavnaugh - 2018/11/28 19:01:09
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 13:33:57
(permalink)
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 13:51:34
(permalink)
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 14:34:06
(permalink)
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 15:10:57
(permalink)
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 15:31:12
(permalink)
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 16:04:11
(permalink)
|
the_Scarlet_one
formerly Scarlet-tech
- Total Posts : 24581
- Reward points : 0
- Joined: 2013/11/13 02:48:57
- Location: East Coast
- Status: offline
- Ribbons : 79
Re: Geforce Drivers 4xx.xx Drop more than 2/3 in CUDA Performance from the 3xx.xx Drvers.
2018/11/28 18:24:53
(permalink)
Wonderful news! NVidia is reproducing the issue in house, and they are investigating now. The post don NVidia: Robert Crovella A widespread performance degradation should have been caught by our QA processes, so if this issue is mostly localized to a single app, that isn't necessarily surprising.
Also, contrary to this thread title, the development team has categorized this as an issue with OpenCL, not CUDA. I assume this means, in spite of the title of the executable, that the underlying (GPU) code is written in OpenCL, not CUDA, however I have not attempted to confirm this myself. The distinction isn't terribly important with respect to issue resolution. Merely a point of clarification for others who might read this thread and wonder if it applies to them.
The issue has been reproduced internally at NVIDIA, based on information provided so far via the bug report. There is a (fairly standard) plan in place to attempt to identify underlying root cause. I don't have any further information to share at this time, and won't be able to respond to requests for more information, most other questions, or any sort of inquiry about what the current status or state of the issue is, until there is sufficient forward progress on the analysis of the bug. At that time, I will do my best to be proactive and provide an update here. Until then, I'm unlikely to respond to requests for more information.
I don't expect the issue to be sorted out rapidly. Working with a compiled binary (as opposed to having source code and active participation from the developer) generally results in a slower progress of issue resolution (using the "standard" plan I referred to. If that plan doesn't yield useful info, progress can be even slower).
And of course, like all issues, resolution of this issue is subject to assessed priority as well as competing priorities in a resource-constrained environment.
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/11/28 18:41:17
(permalink)
Title Updated to Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers And What ^ Said Above.
post edited by bcavnaugh - 2018/11/28 18:59:49
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2018/12/12 09:08:38
(permalink)
Topic: GeForce Drivers 4xx.xx drop more than 2/3 in OpenCL Performance from the 3xx.xx Drivers There is no point in testing newer drivers; I don't expect any changes in this respect. Changes are required in the application if they want to restore performance with the newer drivers. Current Scenario in ap26 app: 1. App queries CL_KERNEL_WORK_GROUP_SIZE in order to decide local work group size of either 1024 (seems optimal) or 64 (sub-optimal). If app gets value for query <1024 it reduces local work group size to 64 assuming device doesn't support 1024. 2. Nvidia OpenCL Driver changed return value for CL_KERNEL_WORK_GROUP_SIZE from 1024 to 256. 3. App is not using CL_KERNEL_WORK_GROUP_SIZE returned by driver as is, but just choosing a non-optimal local work-group size (64) based on this query. What should developers do: • Query CL_KERNEL_WORK_GROUP_SIZE to get just hint about work group size from driver and use it to launch kernel with that specific value. It need not be optimal for all kernels. • App is free to choose any value from range [1 , CL_DEVICE_MAX_WORK_GROUP_SIZE] to get best possible work group size for different kernels, irrespective of CL_KERNEL_WORK_GROUP_SIZE returned by driver. Suggestions specific to ap26: • App can query CL_DEVICE_MAX_WORK_GROUP_SIZE and set work group size accordingly instead of using CL_KERNEL_WORK_GROUP_SIZE. • Simplest solution for ap26 would be to use 1024 work group size directly if it comes in range [1 , CL_DEVICE_MAX_WORK_GROUP_SIZE]. I don't know how to best communicate the above information to the developers. If there is a good way to do that, please advise.
post edited by bcavnaugh - 2018/12/12 09:10:29
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2019/01/15 10:58:20
(permalink)
Game Ready Driver Release Notes (v417.71)Experimental OpenCL 2.0 Features Select features in OpenCL 2.0 are available in the driver for evaluation purposes only. The following are the features as well as a description of known issues with these features in the driver: Device side enqueue • The current implementation is limited to 64-bit platforms only. • OpenCL 2.0 allows kernels to be enqueued with global_work_size larger than the compute capability of the NVIDIA GPU. The current implementation supports only combinations of global_work_size and local_work_size that are within the compute capability of the NVIDIA GPU. For a given grid dimension, the global_work_size can be determined by CUDA grid size x CUDA block size. • For executing kernels (whether from the host or the device), OpenCL 2.0 supports non-uniform ND-ranges where global_work_size does not need to be divisible by the local_work_size. This capability is not yet supported in the NVIDIA driver, and therefore not supported for device side kernel enqueues. Shared virtual memory
• The current implementation of shared virtual memory is limited to 64-bit platforms only.
|
Cool GTX
EVGA Forum Moderator
- Total Posts : 31005
- Reward points : 0
- Joined: 2010/12/12 14:22:25
- Location: Folding for the Greater Good
- Status: offline
- Ribbons : 122
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2019/01/16 13:21:59
(permalink)
OK, but can you say that 3 times really fast ...
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2019/01/16 13:54:00
(permalink)
Still Testing this New BR Driver.
|
bcavnaugh
The Crunchinator
- Total Posts : 38977
- Reward points : 0
- Joined: 2012/09/18 17:31:18
- Location: USA Affiliate E5L3CTGE12 Associate 9E88QK5L7811G3H
- Status: offline
- Ribbons : 282
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2019/01/18 12:27:38
(permalink)
Even with this BR Driver it is still undermining the GTX 1080 Cards.
|
Cool GTX
EVGA Forum Moderator
- Total Posts : 31005
- Reward points : 0
- Joined: 2010/12/12 14:22:25
- Location: Folding for the Greater Good
- Status: offline
- Ribbons : 122
Re: Geforce Drivers 4xx.xx Drop more than 2/3 OpenCL Performance from the 3xx.xx Drivers
2019/01/18 16:23:12
(permalink)
good to know, thanks for the update & continued in depth testing
|