[gpaw-users] "Failed to orthogonalize" when domain size changed.

Marcin Dulak Marcin.Dulak at fysik.dtu.dk
Wed Apr 27 09:04:07 CEST 2011


Hi,

i'm unable to reproduce the problem on 8 cores on our cluster 
(https://wiki.fysik.dtu.dk/niflheim/Hardware), gpaw/0.7.2.6974, 
ase/3.4.1.1765.
Which of the jobs fail? Can you present the output, especially the part 
that concerns the parallelization,
for example for Bi111k-2.txt I get:
------------------------
Total number of cores used: 8
Domain Decomposition: 1 x 2 x 4
Diagonalizer layout: Serial LAPACK
Orthonormalizer layout: Serial LAPACK

Symmetries present: 2
2 k-points in the Irreducible Part of the Brillouin Zone (total: 4)
------------------------
Please also run tests in parallel (see 
https://wiki.fysik.dtu.dk/gpaw/install/installationguide.html#run-the-tests), 
assuming bash:

mpirun -np 8 gpaw-python `which gpaw-test`  2>&1 | tee test.log

Best regards,

Marcin

Chris Willmore wrote:
> Hi All,
>
> I was given a script to run on some spare hardware, which had a hard 
> coded domain of 2. I had 8 cpu's so, I modified the script to use the 
> variable gpaw.mpi.world.size. When the script runs with only 2 nodes 
> it works fine (albeit slower than desired), but when I run with 8 
> nodes, it crashes with a "Failed to orthogonalize" error. Attached is 
> the script. Any suggestions?
>
> Thanks,
> Chris
> ------------------------------------------------------------------------
>
> _______________________________________________
> gpaw-users mailing list
> gpaw-users at listserv.fysik.dtu.dk
> https://listserv.fysik.dtu.dk/mailman/listinfo/gpaw-users

-- 
***********************************
 
Marcin Dulak
Technical University of Denmark
Department of Physics
Building 307, Room 229
DK-2800 Kongens Lyngby
Denmark
Tel.: (+45) 4525 3157
Fax.: (+45) 4593 2399
email: Marcin.Dulak at fysik.dtu.dk

***********************************



More information about the gpaw-users mailing list