Segmentation fault

Message

Gu Chenjie · #1 Post by **Gu Chenjie** » Wed Dec 29, 2010 8:30 am

Dear Sir,
when I run the example from the handson1, the following error happen:

Â runningÂ onÂ Â Â Â 1Â nodes
Â distr:Â Â oneÂ bandÂ onÂ Â Â Â 1Â nodes,Â Â Â Â 1Â groups
Â vasp.5.2.8Â 07Jul10Â complexÂ 
Â POSCARÂ foundÂ :Â Â 1Â typesÂ andÂ Â Â Â 1Â ions
Â LDAÂ part:Â xc-tableÂ forÂ PadeÂ appr.Â ofÂ Perdew
Â POSCAR,Â INCARÂ andÂ KPOINTSÂ ok,Â startingÂ setup
Â WARNING:Â smallÂ aliasingÂ (wrapÂ around)Â errorsÂ mustÂ beÂ expected
Â FFT:Â planningÂ ...(Â Â Â Â Â Â Â Â Â Â Â 1Â )
Â readingÂ WAVECAR
Â WARNING:Â randomÂ wavefunctionsÂ butÂ noÂ delayÂ forÂ mixing,Â defaultÂ forÂ NELMDL
Â enteringÂ mainÂ loop
Â Â Â Â Â Â Â NÂ Â Â Â Â Â Â EÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â dEÂ Â Â Â Â Â Â Â Â Â Â Â Â dÂ epsÂ Â Â Â Â Â Â ncgÂ Â Â Â Â rmsÂ Â Â Â Â Â Â Â Â Â rms(c)
SegmentationÂ fault

I know this should be a compilation problem, because I use Intel's fortran compiler 11,
but for the mpi, I use mpif90, from mpich2.
however is there any way to solve this problem?
thanks a lot.
INCAR

Code: Select all

Â SYSTEMÂ =Â OÂ atomÂ inÂ aÂ box
Â ISMEARÂ =Â 0Â Â !Â GaussianÂ smearing

KPOINTS

Code: Select all

Gamma-pointÂ only
Â 1Â Â Â Â Â Â Â Â !Â oneÂ k-point
recÂ Â Â Â Â Â Â !Â inÂ unitsÂ ofÂ theÂ reciprocalÂ latticeÂ vector
Â 0Â 0Â 0Â 1Â Â !Â 3Â coordinatesÂ andÂ weight

POSCAR

Code: Select all

OÂ atomÂ inÂ aÂ box
Â 1.0Â Â Â Â Â Â Â Â Â Â !Â universalÂ scalingÂ parameters
Â 8.0Â 0.0Â 0.0Â Â !Â latticeÂ vectorÂ Â a(1)
Â 0.0Â 8.0Â 0.0Â Â !Â latticeÂ vectorÂ Â a(2)
Â 0.0Â 0.0Â 8.0Â Â !Â latticeÂ vectorÂ Â a(3)
1Â Â Â Â Â Â Â Â Â Â Â Â Â !Â numberÂ ofÂ atoms
cartÂ Â Â Â Â Â Â Â Â Â !Â positionsÂ inÂ cartesianÂ coordinates
Â 0Â 0Â 0

and finally the makefile

Code: Select all

.SUFFIXES:Â .incÂ .fÂ .f90Â .F
#-----------------------------------------------------------------------
#Â MakefileÂ forÂ IntelÂ FortranÂ compilerÂ forÂ Pentium/Athlon/OpteronÂ 
#Â basesÂ systems
#Â weÂ recommendÂ thisÂ makefileÂ forÂ bothÂ IntelÂ asÂ wellÂ asÂ AMDÂ systems
#Â forÂ AMDÂ basedÂ systemsÂ appropriateÂ BLASÂ andÂ fftwÂ librariesÂ are
#Â howeverÂ mandatoryÂ (whereasÂ theyÂ areÂ optionalÂ forÂ IntelÂ platforms)
#
#Â TheÂ makefileÂ wasÂ testedÂ onlyÂ underÂ LinuxÂ onÂ IntelÂ andÂ AMDÂ platforms
#Â theÂ followingÂ compilerÂ versionsÂ haveÂ beenÂ tested:
#Â Â -Â ifc.7.1Â Â worksÂ stableÂ somewhatÂ slowÂ butÂ reliably
#Â Â -Â ifc.8.1Â Â failsÂ toÂ compileÂ theÂ codeÂ properly
#Â Â -Â ifc.9.1Â Â recommendedÂ (bothÂ forÂ 32Â andÂ 64Â bit)
#Â Â -Â ifc.10.1Â partiallyÂ recommendedÂ (bothÂ forÂ 32Â andÂ 64Â bit)
#Â Â Â Â Â Â Â Â Â Â Â Â Â testedÂ buildÂ 20080312Â PackageÂ ID:Â l_fc_p_10.1.015
#Â Â Â Â Â Â Â Â Â Â Â Â Â theÂ gammaÂ onlyÂ mpiÂ versionÂ canÂ notÂ beÂ compiles
#Â Â Â Â Â Â Â Â Â Â Â Â Â usingÂ ifc.10.1
#
#Â itÂ mightÂ beÂ requiredÂ toÂ changeÂ someÂ ofÂ libraryÂ pathes,Â since
#Â LINUXÂ installationÂ varyÂ aÂ lot
#Â HenceÂ checkÂ ***ALL***Â optionsÂ inÂ thisÂ makefileÂ veryÂ carefully
#-----------------------------------------------------------------------
#
#Â BLASÂ mustÂ beÂ installedÂ onÂ theÂ machine
#Â thereÂ areÂ severalÂ options:
#Â 1)Â veryÂ slowÂ butÂ works:
#Â Â Â retrieveÂ theÂ lapackageÂ fromÂ ftp.netlib.org
#Â Â Â andÂ compileÂ theÂ blasÂ routinesÂ (BLAS/SRCÂ directory)
#Â Â Â pleaseÂ useÂ g77Â orÂ f77Â forÂ theÂ compilation.Â WhenÂ IÂ triedÂ to
#Â Â Â useÂ pgf77Â orÂ pgf90Â forÂ BLAS,Â VASPÂ hangÂ upÂ whenÂ calling
#Â Â Â ZHEEVÂ Â (howeverÂ thisÂ wasÂ withÂ lapackÂ 1.1Â nowÂ IÂ useÂ lapackÂ 2.0)
#Â 2)Â moreÂ desirable:Â getÂ anÂ optimizedÂ BLASÂ 
#
#Â theÂ twoÂ mostÂ reliableÂ packagesÂ aroundÂ areÂ presently:
#Â 2a)Â IntelsÂ ownÂ optimisedÂ BLASÂ (PIII,Â P4,Â PD,Â PC2,Â Itanium)
#Â Â Â Â Â http://developer.intel.com/software/products/mkl/
#Â Â Â thisÂ isÂ reallyÂ excellent,Â ifÂ youÂ useÂ IntelÂ CPU's
#
#Â 2b)Â probablyÂ fastestÂ SSE2Â (4Â GFlopsÂ onÂ P4,Â 2.53Â GHz,Â 16Â GFlopsÂ PD,Â 
#Â Â Â Â Â aroundÂ 30Â GFlopsÂ onÂ QuadÂ core)
#Â Â Â KazushigeÂ Goto'sÂ BLAS
#Â Â Â http://www.cs.utexas.edu/users/kgoto/signup_first.html
#Â Â Â http://www.tacc.utexas.edu/resources/software/
#
#-----------------------------------------------------------------------

#Â allÂ CPPÂ processedÂ fortranÂ filesÂ haveÂ theÂ extensionÂ .f90
SUFFIX=.f90

#-----------------------------------------------------------------------
#Â fortranÂ compilerÂ andÂ linker
#-----------------------------------------------------------------------
FC=ifortÂ 
#Â fortranÂ linker
FCL=$(FC)


#-----------------------------------------------------------------------
#Â whereisÂ CPPÂ ??Â (IÂ needÂ CPP,Â can'tÂ useÂ gccÂ withÂ properÂ options)
#Â that'sÂ theÂ locationÂ ofÂ gccÂ forÂ SUSEÂ 5.3
#
#Â Â CPP_Â Â Â =Â Â /usr/lib/gcc-lib/i486-linux/2.7.2/cppÂ -PÂ -CÂ 
#
#Â that'sÂ probablyÂ theÂ rightÂ lineÂ forÂ someÂ RedÂ HatÂ distribution:
#
#Â Â CPP_Â Â Â =Â Â /usr/lib/gcc-lib/i386-redhat-linux/2.7.2.3/cppÂ -PÂ -C
#
#Â Â SUSEÂ X.X,Â maybeÂ someÂ RedÂ HatÂ distributions:

CPP_Â =Â Â ./preprocessÂ <$*.FÂ |Â /usr/bin/cppÂ -PÂ -CÂ -traditionalÂ >$*$(SUFFIX)

#-----------------------------------------------------------------------
#Â possibleÂ optionsÂ forÂ CPP:
#Â NGXhalfÂ Â Â Â Â Â Â Â Â Â Â Â Â chargeÂ densityÂ Â Â reducedÂ inÂ XÂ direction
#Â wNGXhalfÂ Â Â Â Â Â Â Â Â Â Â Â gammaÂ pointÂ onlyÂ reducedÂ inÂ XÂ direction
#Â avoidallocÂ Â Â Â Â Â Â Â Â Â avoidÂ ALLOCATEÂ ifÂ possible
#Â PGF90Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â workÂ aroundÂ someÂ forÂ someÂ PGF90Â /Â IFCÂ bugs
#Â CACHE_SIZEÂ Â Â Â Â Â Â Â Â Â 1000Â forÂ PII,PIII,Â 5000Â forÂ Athlon,Â 8000-12000Â P4,Â PD
#Â RPROMU_DGEMVÂ Â Â Â Â Â Â Â useÂ DGEMVÂ insteadÂ ofÂ DGEMMÂ inÂ RPROÂ (dependsÂ onÂ usedÂ BLAS)
#Â RACCMU_DGEMVÂ Â Â Â Â Â Â Â useÂ DGEMVÂ insteadÂ ofÂ DGEMMÂ inÂ RACCÂ (dependsÂ onÂ usedÂ BLAS)
#-----------------------------------------------------------------------

CPPÂ Â Â Â Â =Â $(CPP_)Â Â -DHOST=\"LinuxIFC\"Â \
Â Â Â Â Â Â Â Â Â Â -Dkind8Â -DCACHE_SIZE=12000Â -DPGF90Â -DavoidallocÂ -DNGXhalfÂ \
#Â Â Â Â Â Â Â Â Â Â -DRPROMU_DGEMVÂ Â -DRACCMU_DGEMV

#-----------------------------------------------------------------------
#Â generalÂ fortranÂ flagsÂ Â (thereÂ mustÂ aÂ trailingÂ blankÂ onÂ thisÂ line)
#Â bytereclÂ isÂ strictlyÂ requiredÂ forÂ ifc,Â sinceÂ otherwise
#Â theÂ WAVECARÂ fileÂ becomesÂ huge
#-----------------------------------------------------------------------

FFLAGSÂ =Â -I/home/enola/lib/intel/icc/mkl/include/fftwÂ -FRÂ -lowercaseÂ 

#-----------------------------------------------------------------------
#Â optimization
#Â weÂ haveÂ testedÂ whetherÂ higherÂ optimisationÂ improvesÂ performance
#Â -axKÂ Â SSE1Â optimization,Â Â butÂ alsoÂ generateÂ codeÂ executableÂ onÂ allÂ mach.
#Â Â Â Â Â Â Â xKÂ improvesÂ performanceÂ somewhatÂ onÂ XP,Â andÂ aÂ isÂ requiredÂ inÂ order
#Â Â Â Â Â Â Â toÂ runÂ theÂ codeÂ onÂ olderÂ AthlonsÂ asÂ well
#Â -xWÂ Â Â SSE2Â optimization
#Â -axWÂ Â SSE2Â optimization,Â Â butÂ alsoÂ generateÂ codeÂ executableÂ onÂ allÂ mach.
#Â -tpp6Â P3Â optimization
#Â -tpp7Â P4Â optimization
#-----------------------------------------------------------------------

#Â ifc.9.1,Â ifc.10.1Â recommended
OFLAG=-O3

OFLAG_HIGHÂ =Â $(OFLAG)
OBJ_HIGHÂ =Â 
OBJ_NOOPTÂ =Â 
DEBUGÂ Â =Â -FRÂ -O0
INLINEÂ =Â $(OFLAG)

#-----------------------------------------------------------------------
#Â theÂ followingÂ linesÂ specifyÂ theÂ positionÂ ofÂ BLASÂ Â andÂ LAPACK
#Â VASPÂ worksÂ fastestÂ withÂ theÂ libgotoÂ library
#Â soÂ that'sÂ whatÂ weÂ recommend
#-----------------------------------------------------------------------

#Â mkl.10.0
#Â setÂ -DRPROMU_DGEMVÂ Â -DRACCMU_DGEMVÂ inÂ theÂ CPPÂ lines
#BLAS=-L/opt/intel/mkl100/lib/em64tÂ -lmklÂ -lpthread

#Â evenÂ fasterÂ forÂ VASPÂ KazushigeÂ Goto'sÂ BLAS
#Â http://www.cs.utexas.edu/users/kgoto/signup_first.html
#Â parallelÂ gotoÂ versionÂ requiresÂ sometimesÂ -libverbs
#BLAS=Â Â /opt/libs/libgoto/libgoto.so
BLAS=-L/home/enola/lib/intel/icc/mkl/lib/intel64Â -lmkl_intel_lp64Â -lmkl_sequentialÂ -lmkl_coreÂ -lpthread
#Â LAPACK,Â simplestÂ useÂ vasp.5.lib/lapack_double
LAPACK=Â -L/home/enola/lib/intel/icc/mkl/lib/intel64Â -lmkl_intel_lp64Â -lmkl_sequentialÂ -lmkl_coreÂ -lpthread

#Â useÂ theÂ mklÂ IntelÂ lapack
#LAPACK=Â -lmkl_lapack

#-----------------------------------------------------------------------

LIBÂ Â =Â -L../vasp.5.libÂ -ldmyÂ \
Â Â Â Â Â ../vasp.5.lib/linpack_double.oÂ $(LAPACK)Â \
Â Â Â Â Â $(BLAS)

#Â optionsÂ forÂ linking,Â nothingÂ isÂ requiredÂ (usually)
LINKÂ Â Â Â =Â 

#-----------------------------------------------------------------------
#Â fftÂ libraries:
#Â VASP.5.2Â canÂ useÂ fftw.3.1.XÂ (http://www.fftw.org)
#Â sinceÂ thisÂ versionÂ isÂ fasterÂ onÂ P4Â machines,Â weÂ recommendÂ toÂ useÂ it
#-----------------------------------------------------------------------

#FFT3DÂ Â Â =Â fft3dfurth.oÂ fft3dlib.o
FFT3DÂ Â Â =Â Â fftmpiw.oÂ fftmpi_map.oÂ fftw3d.oÂ fft3dlib.oÂ Â /home/enola/lib/intel/icc/mkl/lib/intel64/libfftw3xf_intel.aÂ 
#Â alternatively:Â fftw.3.1.XÂ isÂ slighlyÂ fasterÂ andÂ shouldÂ beÂ usedÂ ifÂ available
#FFT3DÂ Â Â =Â fftw3d.oÂ fft3dlib.oÂ Â Â /opt/libs/fftw-3.1.2/lib/libfftw3.a


#=======================================================================
#Â MPIÂ section,Â uncommentÂ theÂ followingÂ linesÂ untilÂ 
#Â Â Â Â generalÂ Â rulesÂ andÂ compileÂ lines
#Â presentlyÂ weÂ recommendÂ OPENMPI,Â sinceÂ itÂ seemsÂ toÂ offerÂ better
#Â performanceÂ thanÂ lamÂ orÂ mpich
#Â 
#Â !!!Â PleaseÂ doÂ notÂ sendÂ meÂ anyÂ queriesÂ onÂ howÂ toÂ installÂ MPI,Â IÂ will
#Â certainlyÂ notÂ answerÂ themÂ !!!!
#=======================================================================
#-----------------------------------------------------------------------
#Â fortranÂ linkerÂ forÂ mpi
#-----------------------------------------------------------------------

FC=mpif90
FCL=$(FC)

#-----------------------------------------------------------------------
#Â additionalÂ optionsÂ forÂ CPPÂ inÂ parallelÂ versionÂ (seeÂ alsoÂ above):
#Â NGZhalfÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â chargeÂ densityÂ Â Â reducedÂ inÂ ZÂ direction
#Â wNGZhalfÂ Â Â Â Â Â Â Â Â Â Â Â Â Â gammaÂ pointÂ onlyÂ reducedÂ inÂ ZÂ direction
#Â scaLAPACKÂ Â Â Â Â Â Â Â Â Â Â Â Â useÂ scaLAPACKÂ (usuallyÂ slowerÂ onÂ 100Â MbitÂ Net)
#-----------------------------------------------------------------------

CPPÂ Â Â Â =Â $(CPP_)Â -DMPIÂ Â -DHOST=\"LinuxIFC\"Â -DIFCÂ \
Â Â Â Â Â -Dkind8Â -DCACHE_SIZE=4000Â -DPGF90Â -DavoidallocÂ -DNGZhalfÂ \
Â Â Â Â Â -DMPI_BLOCK=8000Â \
Â Â Â Â -DRPROMU_DGEMVÂ Â -DRACCMU_DGEMV

#-----------------------------------------------------------------------
#Â locationÂ ofÂ SCALAPACK
#Â ifÂ youÂ doÂ notÂ useÂ SCALAPACKÂ simplyÂ leaveÂ thatÂ sectionÂ commentedÂ out
#-----------------------------------------------------------------------

#BLACS=$(HOME)/archives/SCALAPACK/BLACS/
#SCA_=$(HOME)/archives/SCALAPACK/SCALAPACK

#SCA=Â $(SCA_)/libscalapack.aÂ Â \
#Â $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.aÂ $(BLACS)/LIB/blacs_MPI-LINUX-0.aÂ $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.a

SCA=

#-----------------------------------------------------------------------
#Â librariesÂ forÂ mpi
#-----------------------------------------------------------------------

#LIBÂ Â Â Â Â =Â -L../vasp.5.libÂ -ldmyÂ Â \
#Â Â Â Â Â Â ../vasp.5.lib/linpack_double.oÂ $(LAPACK)Â \
#Â Â Â Â Â Â $(SCA)Â $(BLAS)

#Â FFT:Â fftmpi.oÂ withÂ fft3dlibÂ ofÂ JuergenÂ Furthmueller
#FFT3DÂ Â Â =Â fftmpi.oÂ fftmpi_map.oÂ fft3dfurth.oÂ fft3dlib.oÂ 

#Â alternatively:Â fftw.3.1.XÂ isÂ slighlyÂ fasterÂ andÂ shouldÂ beÂ usedÂ ifÂ available
#FFT3DÂ Â Â =Â fftmpiw.oÂ fftmpi_map.oÂ fftw3d.oÂ fft3dlib.oÂ Â /opt/libs/fftw-3.1.2/lib/libfftw3.a

#-----------------------------------------------------------------------
#Â generalÂ rulesÂ andÂ compileÂ lines
#-----------------------------------------------------------------------
BASIC=Â Â Â symmetry.oÂ symlib.oÂ Â Â lattlib.oÂ Â random.oÂ Â Â 

SOURCE=Â Â base.oÂ Â Â Â Â mpi.oÂ Â Â Â Â Â smart_allocate.oÂ Â Â Â Â Â xml.oÂ Â \
Â Â Â Â Â Â Â Â Â constant.oÂ jacobi.oÂ Â Â main_mpi.oÂ Â scala.oÂ Â Â \
Â Â Â Â Â Â Â Â Â asa.oÂ Â Â Â Â Â lattice.oÂ Â poscar.oÂ Â Â ini.oÂ Â Â Â Â Â Â xclib.oÂ Â Â Â Â xclib_grad.oÂ \
Â Â Â Â Â Â Â Â Â radial.oÂ Â Â pseudo.oÂ Â Â mgrid.oÂ Â Â Â gridq.oÂ Â Â Â Â ebs.oÂ Â \
Â Â Â Â Â Â Â Â Â mkpoints.oÂ wave.oÂ Â Â Â Â wave_mpi.oÂ Â wave_high.oÂ Â \
Â Â Â Â Â Â Â Â Â $(BASIC)Â Â Â nonl.oÂ Â Â Â Â nonlr.oÂ Â Â Â nonl_high.oÂ dfast.oÂ Â Â Â choleski2.oÂ \
Â Â Â Â Â Â Â Â Â mix.oÂ Â Â Â Â Â hamil.oÂ Â Â Â xcgrad.oÂ Â Â xcspin.oÂ Â Â Â potex1.oÂ Â Â potex2.oÂ Â \
Â Â Â Â Â Â Â Â Â constrmag.oÂ cl_shift.oÂ relativistic.oÂ LDApU.oÂ \
Â Â Â Â Â Â Â Â Â paw_base.oÂ metagga.oÂ Â egrad.oÂ Â Â Â pawsym.oÂ Â Â pawfock.oÂ Â pawlhf.oÂ Â Â Â rhfatm.oÂ Â paw.oÂ Â Â \
Â Â Â Â Â Â Â Â Â mkpoints_full.oÂ Â Â Â Â Â Â charge.oÂ Â Â dipol.oÂ Â Â Â pot.oÂ Â \
Â Â Â Â Â Â Â Â Â dos.oÂ Â Â Â Â Â elf.oÂ Â Â Â Â Â tet.oÂ Â Â Â Â Â tetweight.oÂ hamil_rot.oÂ \
Â Â Â Â Â Â Â Â Â steep.oÂ Â Â Â chain.oÂ Â Â Â dyna.oÂ Â Â Â Â sphpro.oÂ Â Â Â us.oÂ Â core_rel.oÂ \
Â Â Â Â Â Â Â Â Â aedens.oÂ Â Â wavpre.oÂ Â Â wavpre_noio.oÂ broyden.oÂ \
Â Â Â Â Â Â Â Â Â dynbr.oÂ Â Â Â rmm-diis.oÂ reader.oÂ Â Â writer.oÂ Â Â tutor.oÂ xml_writer.oÂ \
Â Â Â Â Â Â Â Â Â brent.oÂ Â Â Â stufak.oÂ Â Â fileio.oÂ Â Â opergrid.oÂ stepver.oÂ Â \
Â Â Â Â Â Â Â Â Â chgloc.oÂ Â Â fast_aug.oÂ fock.oÂ Â Â Â Â mkpoints_change.oÂ sym_grad.oÂ \
Â Â Â Â Â Â Â Â Â mymath.oÂ Â Â internals.oÂ dimer_heyden.oÂ dvvtrajectory.oÂ vdwforcefield.oÂ \
Â Â Â Â Â Â Â Â Â hamil_high.oÂ nmr.oÂ Â Â Â force.oÂ \
Â Â Â Â Â Â Â Â Â pead.oÂ Â Â Â Â subrot.oÂ Â Â subrot_scf.oÂ pwlhf.oÂ Â gw_model.oÂ optreal.oÂ Â Â davidson.oÂ \
Â Â Â Â Â Â Â Â Â electron.oÂ rot.oÂ Â electron_all.oÂ shm.oÂ Â Â Â pardens.oÂ Â paircorrection.oÂ \
Â Â Â Â Â Â Â Â Â optics.oÂ Â Â constr_cell_relax.oÂ Â Â stm.oÂ Â Â Â finite_diff.oÂ elpol.oÂ Â Â Â \
Â Â Â Â Â Â Â Â Â hamil_lr.oÂ rmm-diis_lr.oÂ Â subrot_cluster.oÂ subrot_lr.oÂ \
Â Â Â Â Â Â Â Â Â lr_helper.oÂ hamil_lrf.oÂ Â Â elinear_response.oÂ ilinear_response.oÂ \
Â Â Â Â Â Â Â Â Â linear_optics.oÂ linear_response.oÂ Â Â \
Â Â Â Â Â Â Â Â Â setlocalpp.oÂ Â wannier.oÂ electron_OEP.oÂ electron_lhf.oÂ twoelectron4o.oÂ \
Â Â Â Â Â Â Â Â Â ratpol.oÂ screened_2e.oÂ wave_cacher.oÂ chi_base.oÂ wpot.oÂ local_field.oÂ \
Â Â Â Â Â Â Â Â Â ump2.oÂ bse.oÂ acfdt.oÂ chi.oÂ sydmat.oÂ 

INC=

vasp:Â $(SOURCE)Â $(FFT3D)Â $(INC)Â main.oÂ 
	rmÂ -fÂ vasp
	$(FCL)Â -oÂ vaspÂ main.oÂ Â $(SOURCE)Â Â Â $(FFT3D)Â $(LIB)Â $(LINK)
makeparam:Â $(SOURCE)Â $(FFT3D)Â makeparam.oÂ main.FÂ $(INC)
	$(FCL)Â -oÂ makeparamÂ Â $(LINK)Â makeparam.oÂ $(SOURCE)Â $(FFT3D)Â $(LIB)
zgemmtest:Â zgemmtest.oÂ base.oÂ random.oÂ $(INC)
	$(FCL)Â -oÂ zgemmtestÂ $(LINK)Â zgemmtest.oÂ random.oÂ base.oÂ $(LIB)
dgemmtest:Â dgemmtest.oÂ base.oÂ random.oÂ $(INC)
	$(FCL)Â -oÂ dgemmtestÂ $(LINK)Â dgemmtest.oÂ random.oÂ base.oÂ $(LIB)Â 
ffttest:Â base.oÂ smart_allocate.oÂ mpi.oÂ mgrid.oÂ random.oÂ ffttest.oÂ $(FFT3D)Â $(INC)
	$(FCL)Â -oÂ ffttestÂ $(LINK)Â ffttest.oÂ mpi.oÂ mgrid.oÂ random.oÂ smart_allocate.oÂ base.oÂ $(FFT3D)Â $(LIB)
kpoints:Â $(SOURCE)Â $(FFT3D)Â makekpoints.oÂ main.FÂ $(INC)
	$(FCL)Â -oÂ kpointsÂ $(LINK)Â makekpoints.oÂ $(SOURCE)Â $(FFT3D)Â $(LIB)

clean:	
	-rmÂ -fÂ *.gÂ *.fÂ *.oÂ *.LÂ *.modÂ ;Â touchÂ *.F

main.o:Â main$(SUFFIX)
	$(FC)Â $(FFLAGS)$(DEBUG)Â Â $(INCS)Â -cÂ main$(SUFFIX)
xcgrad.o:Â xcgrad$(SUFFIX)
	$(FC)Â $(FFLAGS)Â $(INLINE)Â Â $(INCS)Â -cÂ xcgrad$(SUFFIX)
xcspin.o:Â xcspin$(SUFFIX)
	$(FC)Â $(FFLAGS)Â $(INLINE)Â Â $(INCS)Â -cÂ xcspin$(SUFFIX)

makeparam.o:Â makeparam$(SUFFIX)
	$(FC)Â $(FFLAGS)$(DEBUG)Â Â $(INCS)Â -cÂ makeparam$(SUFFIX)

makeparam$(SUFFIX):Â makeparam.FÂ main.FÂ 
#
#Â MIND:Â IÂ doÂ notÂ haveÂ aÂ fullÂ dependencyÂ listÂ forÂ theÂ include
#Â andÂ MODULES:Â hereÂ areÂ onlyÂ theÂ minimalÂ basicÂ dependencies
#Â ifÂ oneÂ strucutureÂ isÂ changedÂ thenÂ touch_depÂ mustÂ beÂ called
#Â withÂ theÂ correspondingÂ nameÂ ofÂ theÂ structure
#
base.o:Â base.incÂ base.F
mgrid.o:Â mgrid.incÂ mgrid.F
constant.o:Â constant.incÂ constant.F
lattice.o:Â lattice.incÂ lattice.F
setex.o:Â setexm.incÂ setex.F
pseudo.o:Â pseudo.incÂ pseudo.F
poscar.o:Â poscar.incÂ poscar.F
mkpoints.o:Â mkpoints.incÂ mkpoints.F
wave.o:Â wave.F
nonl.o:Â nonl.incÂ nonl.F
nonlr.o:Â nonlr.incÂ nonlr.F

$(OBJ_HIGH):
	$(CPP)
	$(FC)Â $(FFLAGS)Â $(OFLAG_HIGH)Â $(INCS)Â -cÂ $*$(SUFFIX)
$(OBJ_NOOPT):
	$(CPP)
	$(FC)Â $(FFLAGS)Â $(INCS)Â -cÂ $*$(SUFFIX)

fft3dlib_f77.o:Â fft3dlib_f77.F
	$(CPP)
	$(F77)Â $(FFLAGS_F77)Â -cÂ $*$(SUFFIX)

.F.o:
	$(CPP)
	$(FC)Â $(FFLAGS)Â $(OFLAG)Â $(INCS)Â -cÂ $*$(SUFFIX)
.F$(SUFFIX):
	$(CPP)
$(SUFFIX).o:
	$(FC)Â $(FFLAGS)Â $(OFLAG)Â $(INCS)Â -cÂ $*$(SUFFIX)

#Â specialÂ rules
#-----------------------------------------------------------------------
#Â theseÂ specialÂ rulesÂ areÂ cummulativeÂ (thatÂ isÂ onceÂ failed
#Â Â Â inÂ oneÂ compilerÂ version,Â staysÂ inÂ theÂ listÂ forever)
#Â -tpp5|6|7Â P,Â PII-PIII,Â PIV
#Â -xWÂ useÂ SIMDÂ (doesÂ notÂ payÂ ofÂ onÂ PII,Â sinceÂ fft3dÂ usesÂ doubleÂ prec)
#Â allÂ otherÂ optionsÂ doÂ noÂ affectÂ theÂ codeÂ performanceÂ sinceÂ -O1Â isÂ used

fft3dlib.oÂ :Â fft3dlib.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O2Â -cÂ $*$(SUFFIX)

fft3dfurth.oÂ :Â fft3dfurth.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

fftw3d.oÂ :Â fftw3d.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

wave_high.oÂ :Â wave_high.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

radial.oÂ :Â radial.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

symlib.oÂ :Â symlib.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

symmetry.oÂ :Â symmetry.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

wave_mpi.oÂ :Â wave_mpi.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

wave.oÂ :Â wave.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

dynbr.oÂ :Â dynbr.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

asa.oÂ :Â asa.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

broyden.oÂ :Â broyden.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O2Â -cÂ $*$(SUFFIX)

us.oÂ :Â us.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O1Â -cÂ $*$(SUFFIX)

LDApU.oÂ :Â LDApU.F
	$(CPP)
	$(FC)Â -FRÂ -lowercaseÂ -O2Â -cÂ $*$(SUFFIX)

&|
<span class='smallblacktext'>[ Edited ]</span>

forsdan · #2 Post by **forsdan** » Wed Dec 29, 2010 7:10 pm

1. Please note that mpif90 is just a wrapper compiler for MPI and does not compile or link applications itself. It only adds in command line flags and invokes the underlying fortran 90 compiler. You can usually check the command line that would be executed to compile the program by "mpif90 -showme".

2. In your previous thread I saw that you have switched from vasp 4.6 to vasp 5.2. Have you accounted for that large arrays now is loaded from the stack in vasp 5.2 instead of the heap in vasp 4.6 ? Please see

http://cms.mpi.univie.ac.at/vasp-forum/ ... php?2.7459

If your stacksize is too small you will get a seg. fault.

Otherwise if this is accounted for, you should go through the standard procedure to check if the serial version works, the linking is correct, and if the MPI libraries, compilers, and system are set up properly. A debugger is always helpful as well.

Cheers,
/Dan

<span class='smallblacktext'>[ Edited Wed Dec 29 2010, 08:17PM ]</span>

Gu Chenjie · #3 Post by **Gu Chenjie** » Thu Jan 06, 2011 7:01 am

Hi frosdan, thanks a lot for your reply.
yes, if I set the stack to unlimit, the vasp.5 can work on the single node, which contain 24 cores. However, when I try to link two nodes together to do the calculation, it failed.

Code: Select all

FatalÂ errorÂ inÂ MPI_Waitall:Â OtherÂ MPIÂ error,Â errorÂ stack:
MPI_Waitall(261)..................:Â MPI_Waitall(count=46,Â req_array=0x7fffeeca46a0,Â status_array=0x7fffeeca4760)Â failed
MPIDI_CH3I_Progress(150)..........:Â 
MPID_nem_mpich2_blocking_recv(948):Â 
MPID_nem_tcp_connpoll(1709).......:Â CommunicationÂ error
rankÂ 23Â inÂ jobÂ 1Â Â node0_55860Â Â Â causedÂ collectiveÂ abortÂ ofÂ allÂ ranks
Â Â exitÂ statusÂ ofÂ rankÂ 23:Â killedÂ byÂ signalÂ 9

I think the problem still comes from the stack, though I already set both of the stack of these two nodes to unlimited.

Thanks a lot,
Have a nice day.
Chenjie GU