I am trying to run a Simple CUDA program which queries the numbers of GPUs in the nodes using Open MPI v1.6.1 (64 bit) in Windows 7 Professional 64 bit Machines. When I run the application in root node, the correct GPU count is getting. But when the same application is tried to run in different nodes remotely using Open MPI, the GPU count is getting as 0, ie, it is showing as No CUDA capable GPU is present in the node.

The root and the nodes having Compute Capability 2.1 capable GPUs. I have built the code with CUDA 4.2. When a normal MPI application is running, it is working correctly.

Is there any special settings required for running a CUDA application in Windows 7 using Open MPI v1.6.1?