From panda at cse.ohio-state.edu Fri Nov 7 01:52:08 2008 From: panda at cse.ohio-state.edu (Dhabaleswar Panda) Date: Fri Nov 7 01:52:24 2008 Subject: [mvapich] Announcing the release of MVAPICH2 1.2 Message-ID: You are receiving this e-mail because of: 1) being a member of more than 790 organizations world-wide who have downloaded MVAPICH/MVAPICH2 (High Performance MPI-1 and MPI-2 over InfiniBand and other RDMA-enabled interconnects) from the OSU web site and/or 2) your interest in the MVAPICH/MVAPICH2 package. The MVAPICH team is pleased to announce the availability of MVAPICH2-1.2 with the following NEW features: - Scalable and robust daemon-less job startup - Enhanced and robust mpirun_rsh framework (non-MPD-based) to provide scalable job launching on multi-thousand core clusters - Available for OpenFabrics (IB and iWARP) and uDAPL interfaces (including Solaris) - Support for Totalview debugger - Checkpoint-restart with intra-node shared memory support - Allows best performance and scalability with fault-tolerance support - Enhancement to software installation - Full autoconf-based configuration - Automatically detects system architecture and adapter types and optimizes MVAPICH2 for any particular installation - An application (mpiname) for querying the MVAPICH2 library version and configuration information - Enhanced processor affinity using PLPA for multi-core architectures - Allows user-defined flexible processor affinity - Enhanced scalability for RDMA-based direct one-sided communication with less communication resource - Available for OpenFabrics (IB and iWARP) interfaces - Shared memory optimized algorithm for MPI_Bcast operation - Optimized and tuned MPI_Alltoall - Based on MPICH2 1.0.7 More details on all features and supported platforms can be obtained by visiting the following URL: http://mvapich.cse.ohio-state.edu/overview/mvapich2/features.shtml MVAPICH2 1.2 is being made available with OFED 1.4. It is also tested with OFED 1.3. It continues to deliver excellent performance. Sample performance numbers include: OpenFabrics/Gen2 on EM64T quad-core with PCIe-Gen2 and ConnectX-QDR: Two-sided operations: - 1.25 microsec one-way latency (4 bytes) - 2573 MB/sec unidirectional bandwidth - 5037 MB/sec bidirectional bandwidth One-sided operations: - 2.73 microsec Put latency (4 bytes) - 2576 MB/sec unidirectional Put bandwidth - 4921 MB/sec bidirectional Put bandwidth Performance numbers for several other platforms, system configurations and operations can be viewed by visiting `Performance' section of the project's web page. For downloading MVAPICH2 1.2 package and accessing the anonymous SVN, please visit the following URL: http://mvapich.cse.ohio-state.edu/ All feedbacks, including bug reports, hints for performance tuning, patches and enhancements are welcome. Please post it to the mvapich-discuss mailing list. Thanks, The MVAPICH Team From panda at cse.ohio-state.edu Fri Nov 14 22:52:30 2008 From: panda at cse.ohio-state.edu (Dhabaleswar Panda) Date: Fri Nov 14 22:52:44 2008 Subject: [mvapich] Announcing the release of MVAPICH 1.1 Message-ID: You are receiving this e-mail because of: 1) being a member of more than 800 organizations world-wide who have downloaded MVAPICH/MVAPICH2 (High Performance MPI-1 and MPI-2 over InfiniBand and other RDMA-enabled interconnects) from the OSU web site and/or 2) your interest in the MVAPICH/MVAPICH2 package. The MVAPICH team is pleased to announce the availability of MVAPICH-1.1 with the following NEW features: - New Features for OpenFabrics Gen2-IB Interface - eXtended Reliable Connection (XRC) support - Lock-free design to provide support for asynchronous progress at both sender and receiver to overlap computation and communication - Optimized MPI_allgather collective - Efficient intra-node shared memory communication support for diskless clusters - Enhanced Totalview Support with the new mpirun_rsh framework - New OpenFabrics Gen2-Hybrid Interface - Replaces the Gen2-UD interface of MVAPICH 1.0 series - Targeted for large-scale IB clusters (multi-thousand cores) to provide highest performance and minimal memory usage - Support for UD, RC and XRC transports - Adaptive selection during run-time (based on application and systems characteristics) to switch between RC and UD (or between XRC and UD) transports - Delivers performance and scalability with near constant memory footprint for communication contexts - Zero-copy protocol with UD for large data transfer - Multiple buffer organizations with XRC support - Shared memory communication between cores within a node - Efficient intra-node shared memory communication support for diskless clusters - Multi-core optimized collectives (MPI_Bcast, MPI_Barrier, MPI_Reduce and MPI_Allreduce) - Optimized MPI_Allgather collective - Enhanced Totalview Support with the new mpirun_rsh framework - New Features for MVAPICH-InfiniPath (QLogic) Interface - Enhanced Totalview Support with the new mpirun_rsh framework - New Features for Shared-Memory only Interface - Enhanced Totalview Support with the new mpirun_rsh framework More details on all features and supported platforms can be obtained by visiting the following URL: http://mvapich.cse.ohio-state.edu/overview/mvapich/features.shtml MVAPICH 1.1 is being made available with OFED 1.4. It is also tested with OFED 1.3. It continues to deliver excellent performance. Sample performance numbers include: OpenFabrics/Gen2-IB on EM64T quad-core with PCIe2 and ConnectX-QDR: - 1.17 microsec one-way latency (4 bytes) - 2569 MB/sec unidirectional bandwidth - 5025 MB/sec bidirectional bandwidth OpenFabrics/Gen2-Hybrid on EM64T quad-core with PCIe2 and ConnectX-QDR: - 1.18 microsec one-way latency (4 bytes) - 2571 MB/sec unidirectional bandwidth - 5027 MB/sec bidirectional bandwidth OpenFabrics/Gen2-IB on Opteron quad-core with PCIe and ConnectX-DDR: - 1.62 microsec one-way latency (4 bytes) - 1628 MB/sec unidirectional bandwidth - 2889 MB/sec bidirectional bandwidth InfiniPath on EM64T quad-core with PCIe2 and QLogic-DDR: - 1.28 microsec one-way latency (4 bytes) - 1953 MB/sec unidirectional bandwidth Performance numbers for several other platforms, system configurations and operations can be viewed by visiting `Performance' section of the project's web page. For downloading MVAPICH 1.1 package and accessing the anonymous SVN, please visit the following URL: http://mvapich.cse.ohio-state.edu/ All feedbacks, including bug reports, hints for performance tuning, patches and enhancements are welcome. Please post it to the mvapich-discuss mailing list. Thanks, The MVAPICH Team