<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-1">
<META content="MSHTML 6.00.6000.16640" name=GENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=#ffffff>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2> Hello
all:</FONT></DIV>
<DIV><FONT face=Arial size=2> </FONT></DIV>
<DIV><FONT face=Arial size=2> Good morning!
</FONT></DIV>
<DIV><FONT face=Arial size=2> I set up a
64-core cluster based on ROCKS-5.0 using eight Dell PE2900 boxes.</FONT></DIV>
<DIV><FONT face=Arial size=2> All are
dual-processor QC machines.</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2> compiled
MVAPICH-1.0 (using intel compiler) with default parameters in
make.mvapich.gen2</FONT></DIV>
<DIV><FONT face=Arial size=2> IB stack is
OFED-1.2.5.5. </FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2> My MD
program (PMEMD/AMBER) is compiled with no errors with IFORT/MKL libraries
and</FONT></DIV>
<DIV><FONT face=Arial size=2> I could run
the code on all 64 cores, but the scaling from 16 to 32 to 64 is terrible. I am
inclosing</FONT></DIV>
<DIV><FONT face=Arial size=2> the benchmarks
on a test run.</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial
size=2> <FONT
face="Times New Roman" size=3> # of CPUs/cores Time
(sec) Nodes
(load-balanced) Scaling
(%)<BR>
8
82
8 100
<BR>
16
49
8
84
<BR>
32
42
8 49<BR>
64
39
8
26</FONT><BR></FONT></DIV>
<DIV><FONT face=Arial
size=2> In contrast,
on single box, I get a reasonable scaling.</FONT></DIV><FONT face=Arial
size=2><FONT face="Times New Roman" size=3><FONT face=Arial size=2></FONT>
<DIV><FONT face=Arial size=2></FONT><BR> #
cores time
(sec)<BR>
2
284
(100%)<BR>
4
164 (87%<BR>
8
107 (65%)<BR><BR> For some
reason, I suspect, MPI traffic is not going over IB net.</FONT><BR></DIV></FONT>
<DIV><FONT face=Arial
size=2> <FONT
face="Times New Roman" size=3>MVAPICH is built using make.mvapich.gen2 with
F77=ifort and CC=gcc<BR><BR> mpif77 -link_info
is:<BR><BR> /state/partition1/fc91052/bin/ifort
-L/usr/local/ofed/lib64 -L/usr/local/mvapich/lib <BR>
-lmpich -L/usr/local/ofed/lib64 -Wl,-rpath=/usr/local/ofed/lib64
-libverbs <BR> -libumad -lpthread -lpthread
-lrt</FONT><BR><BR></FONT></DIV>
<DIV><FONT face=Arial size=2> How can I be sure that MPI
traffic is going through IB network rather than ethernet?</FONT></DIV>
<DIV><FONT face=Arial size=2> Are there any specific
checks I should perform?</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2> Thanks a lot for your
help.</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT> </DIV>
<DIV><FONT face=Arial size=2> Divi</FONT></DIV>
<DIV><FONT face=Arial size=2> </DIV></FONT></BODY></HTML>