[gpaw-users] Segmentation fault with scalapack diagonalization
Carrie Farberow
farberow at wisc.edu
Thu Jul 8 00:45:09 CEST 2010
Dear gpaw-users,
I found that this was due to linking to the wrong blacs library. I was incorrectly using libmkl_blacs_lp64.a, rather than libmkl_blacs_openmpi_lp64.a.
Thanks,
Carrie
----- Original Message -----
From: Carrie Farberow <farberow at wisc.edu>
Date: Wednesday, July 7, 2010 12:50 am
Subject: [gpaw-users] Segmentation fault with scalapack diagonalization
To: gpaw users <gpaw-users at listserv.fysik.dtu.dk>
> Dear gpaw-users,
>
> When running the scalapack diagonalization test on the GPAW
> installation webpage:
>
> mpirun -np 2 gpaw-python ~/gpaw/test/CH4.py --sl_diagonalize=1,2,2
>
> The run crashes and I get the following segmentation fault:
>
> [eu095:06249] Signal: Segmentation fault (11)
> [eu095:06249] Signal code: Address not mapped (1)
> [eu095:06249] Failing at address: 0xf3
> [eu095:06249] [ 0] /lib64/libpthread.so.0 [0x3e3420e7c0]
> [eu095:06249] [ 1]
> /home/cfarberow/opt/gpaw-0.6.5147/build/bin.linux-x86_64-2.4/gpaw-python(MPI_Comm_size+0x52)
> [0xc530c2]
> [eu095:06249] [ 2]
> /home/cfarberow/opt/gpaw-0.6.5147/build/bin.linux-x86_64-2.4/gpaw-python(blacs_pinfo_+0x9d)
> [0xc24481]
> [eu095:06249] [ 3]
> /home/cfarberow/opt/gpaw-0.6.5147/build/bin.linux-x86_64-2.4/gpaw-python(sl_init_+0x22)
> [0xa10032]
> [eu095:06249] [ 4]
> /home/cfarberow/opt/gpaw-0.6.5147/build/bin.linux-x86_64-2.4/gpaw-python
> [0x45fb2a]
> [eu095:06249] [ 5]
> /home/cfarberow/opt/gpaw-0.6.5147/build/bin.linux-x86_64-2.4/gpaw-python(PyCFunction_Call+0x110)
> [0x4dd0f0]
> ...
>
> Without the --sl_diagonalize=1,2,2 the CH4.py script produces the
> correct results.
>
> In addition and possibly related, when running the testsuite the
> following segmentation fault prints:
>
> [eu095:30534] Signal: Segmentation fault (11)
> [eu095:30534] Signal code: (128)
> [eu095:30534] Failing at address: (nil)
> [eu095:30534] [ 0] /lib64/libpthread.so.0 [0x3e3420e7c0]
> [eu095:30534] [ 1]
> /home/cfarberow/opt/gpaw-0.6.5147/build/bin.linux-x86_64-2.4/gpaw-python(mkl_lapack_zladiv+0x25)
> [0x910add]
> [eu095:30534] [ 2]
> /home/cfarberow/opt/gpaw-0.6.5147/build/bin.linux-x86_64-2.4/gpaw-python(ZLADIV+0x6)
> [0xadc85a]
> [eu095:30534] [ 3] /usr/lib64/atlas/liblapack.so.3(zlahqr_+0x1224) [0x2aaab6c69bf4]
> [eu095:30534] [ 4] /usr/lib64/atlas/liblapack.so.3(zhseqr_+0x8d3) [0x2aaab6c5b9a3]
> [eu095:30534] [ 5] /usr/lib64/atlas/liblapack.so.3(zgeev_+0x695) [0x2aaab6bfc0a5]
> [eu095:30534] [ 6]
> /usr/lib64/python2.4/site-packages/numpy/linalg/lapack_lite.so [0x2aaab66f36c8]
> [eu095:30534] [ 7]
> /home/cfarberow/opt/gpaw-0.6.5147/build/bin.linux-x86_64-2.4/gpaw-python(PyCFunction_Call+0x110)
> [0x4dd0f0]
> [eu095:30534] [ 8]
> /home/cfarberow/opt/gpaw-0.6.5147/build/bin.linux-x86_64-2.4/gpaw-python
> [0x54c7b0]
> ...
>
> Simple scripts for slab calculations don't throw any errors, but I'd
> like to do some scaling tests for large clusters and its my
> understanding that this is best with scalapack enabled. I've tried
> multiple releases of GPAW and continue to get the same errors. I'm
> currently working with gpaw-0.6.5147.tar.gz. I'm using Intel_MKL
> (which I've also tried multiple versions of - currently using
> 10.0.4.023). I'm also using Python 2.4.4, numpy-1.2.1 and openmpi-1.4.1.
>
> Any recommendations would be greatly appreciated.
>
> Best regards,
> Carrie
> _______________________________________________
> gpaw-users mailing list
> gpaw-users at listserv.fysik.dtu.dk
> https://listserv.fysik.dtu.dk/mailman/listinfo/gpaw-users
>
More information about the gpaw-users
mailing list