From panda at cse.ohio-state.edu Fri Sep 14 19:05:02 2007 From: panda at cse.ohio-state.edu (Dhabaleswar Panda) Date: Fri Sep 14 19:20:33 2007 Subject: [mvapich] Announcing the release of MVAPICH2 1.0 Message-ID: <200709142305.l8EN52u3023352@xi.cse.ohio-state.edu> You are receiving this e-mail because of: 1) being a member of more than 555 organizations world-wide who have downloaded MVAPICH/MVAPICH2 (High Performance MPI-1 and MPI-2 over InfiniBand and other RDMA-enabled interconnects) from the OSU web site and/or 2) your interest in the MVAPICH/MVAPICH2 package. The MVAPICH team is pleased to announce the availability of MVAPICH2-1.0 with the following NEW features: - Message coalescing support to enable reduction of per Queue-pair send queues for reduction in memory requirement on large scale clusters. This design also increases the small message messaging rate significantly. Available for Open Fabrics Gen2-IB. - Hot-Spot Avoidance Mechanism (HSAM) for alleviating network congestion in large scale clusters. Available for Open Fabrics Gen2-IB. - RDMA CM based on-demand connection management for large scale clusters. Available for OpenFabrics Gen2-IB and Gen2-iWARP. - uDAPL on-demand connection management for large scale clusters. Available for uDAPL interface (including Solaris IB implementation). - RDMA Read support for increased overlap of computation and communication. Available for OpenFabrics Gen2-IB and Gen2-iWARP. - Application-initiated system-level (synchronous) check-pointing in addition to the user-transparent check-pointing. User application can now request a whole program checkpoint synchronously with BLCR by calling special functions within the application. Available for OpenFabrics Gen2-IB. - Network-Level fault tolerance with Automatic Path Migration (APM) for tolerating intermittent network failures over InfiniBand. Available for OpenFabrics Gen2-IB. - Integrated multi-rail communication support for OpenFabrics Gen2-iWARP and RDMA CM (with Gen2-IB). - RDMA based Direct One-sided communication support for OpenFabrics Gen2-iWARP and RDMA CM (with Gen2-IB). - Blocking mode of communication progress. Available for OpenFabrics Gen2-IB. - Based on MPICH2 1.0.5p4. More details on all features and supported platforms can be obtained by visiting the following URL: http://mvapich.cse.ohio-state.edu/overview/mvapich2/features.shtml MVAPICH2 1.0 is tested with OFED 1.1, OFED 1.2 and OFED 1.2.5 (for ConnectX). It continues to deliver excellent performance. Sample performance numbers include: OpenFabrics/Gen2 on EM64T quad-core with PCIe and ConnectX-DDR: Two-sided operations: - 1.66 microsec one-way latency (4 bytes) - 1405 MB/sec unidirectional bandwidth - 2716 MB/sec bidirectional bandwidth One-sided operations: - 3.19 microsec Put latency - 1405 MB/sec unidirectional Put bandwidth - 2716 MB/sec bidirectional Put bandwidth Performance numbers for all other platforms, system configurations and operations can be viewed by visiting `Performance' section of the project's web page. For downloading MVAPICH2 1.0 package and accessing the anonymous SVN, please visit the following URL: http://mvapich.cse.ohio-state.edu/ All feedbacks, including bug reports, hints for performance tuning, patches and enhancements are welcome. Please post it to mvapich-discuss mailing list. Thanks, MVAPICH Team at OSU/NBCL -------- PS: If you would like to be removed from this mailing list, please send an e-mail to mvapich_request@cse.ohio-state.edu. ====================================================================== MVAPICH/MVAPICH2 project is currently supported with funding from U.S. National Science Foundation, U.S. DOE Office of Science, Mellanox, Intel, Cisco Systems, QLogic, Sun Microsystems and Linux Networx; and with equipment support from Advanced Clustering, AMD, Appro, Chelsio, Dell, Fujitsu, Fulcrum, IBM, Intel, Mellanox, Microway, NetEffect, QLogic and Sun Microsystems. Other technology partner includes Etnus. ======================================================================