[gpaw-users] [ase-users] MPI crush in my ASE script
Marcin Dulak
Marcin.Dulak at fysik.dtu.dk
Fri May 20 11:10:07 CEST 2011
Hi,
it looks like this is an error that will be hard to diagnose without
access to your computer system.
The only thing I changed in your script was to not to generate the setup
- just use the one from the tarball.
It run for me OK on 8 opteron cores (I send traj in a separate mail).
Anyway you will probably consider some other cluster geometries,
so maybe the error won't reappear.
Best regards,
Marcin
Aleksander Dawid wrote:
> Hi Marcin
>
> I sent you the file you mentioned (Ar13MDnew.traj)
> I hope it helps
>
> Best
> Aleksander
>
> Quoting Marcin Dulak <Marcin.Dulak at fysik.dtu.dk>:
>
>> Hi,
>>
>> i don't have access to any installation that uses mpich2, maybe
>> someone has?
>> Do you get any errors from python (*err file), any suspicious
>> behavior in other output files?
>> Make sure that this is not the queuing system itself that terminates
>> the job,
>> and attach /home/ccadawid/scratch-lustre/DFT/ArCluster/Ar13MDnew.traj
>> As I said generating setups in every run it's not a good idea - you
>> may get
>> slightly different setups generated on different machines and such
>> errors tend to accumulate during long optimizer runs.
>>
>> Best regards,
>>
>> Marcin
>>
>> Aleksander Dawid wrote:
>>> Hi all ase-users
>>>
>>> I have run MD simulation of Ar13 cluster using MPICH2 library, after
>>> 314 steps
>>> I have obtained following error
>>>
>>> cli_0]: aborting job:
>>> Fatal error in MPI_Reduce: Internal MPI error!, error stack:
>>> MPI_Reduce(850)...: MPI_Reduce(sbuf=0x44e0ea60, rbuf=(nil),
>>> count=2129920, MPI_DOUBLE, MPI_SUM, root=9, MPI_COMM_WORLD) failed
>>> MPIR_Reduce(297)..:
>>> MPIC_Sendrecv(119):
>>> (unknown)(): Internal MPI error!
>>>
>>> The python script file and PBS script are enclosed as the attachment
>>>
>>> Please help me figured out on which side is the problem, in my
>>> script or in MPI library
>>>
>>> With the best regards
>>> Aleksander Dawid
>>>
>>>
>>> ======================================================
>>> Aleksander Dawid
>>> University of Silesia, Devision Of Computational Physics And
>>> Electronics
>>> email: aleksander.dawid at us.edu.pl
>>> ======================================================
>>>
>>> ?
>>> ----------------------------------------------------
>>> Uniwersytet S'la;ski w Katowicach http://www.us.edu.pl
>>> ------------------------------------------------------------------------
>>>
>>>
>>> _______________________________________________
>>> ase-users mailing list
>>> ase-users at listserv.fysik.dtu.dk
>>> https://listserv.fysik.dtu.dk/mailman/listinfo/ase-users
>>
>> --
>> ***********************************
>>
>> Marcin Dulak
>> Technical University of Denmark
>> Department of Physics
>> Building 307, Room 229
>> DK-2800 Kongens Lyngby
>> Denmark
>> Tel.: (+45) 4525 3157
>> Fax.: (+45) 4593 2399
>> email: Marcin.Dulak at fysik.dtu.dk
>>
>> ***********************************
>>
>>
>
>
>
> ======================================================
> Aleksander Dawid
> University of Silesia, Devision Of Computational Physics And Electronics
> email: aleksander.dawid at us.edu.pl
> ======================================================
>
> ?
> ----------------------------------------------------
> Uniwersytet S'la;ski w Katowicach http://www.us.edu.pl
--
***********************************
Marcin Dulak
Technical University of Denmark
Department of Physics
Building 307, Room 229
DK-2800 Kongens Lyngby
Denmark
Tel.: (+45) 4525 3157
Fax.: (+45) 4593 2399
email: Marcin.Dulak at fysik.dtu.dk
***********************************
More information about the gpaw-users
mailing list