skip to main content
10.1145/76263acmconferencesBook PagePublication PagesscConference Proceedingsconference-collections
Supercomputing '89: Proceedings of the 1989 ACM/IEEE conference on Supercomputing
ACM1989 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
SC '89: International Conference for High Performance Computing, Networking, Storage and Analysis Reno Nevada USA November 12 - 17, 1989
ISBN:
978-0-89791-341-6
Published:
01 August 1989
Sponsors:
SIGARCH, IEEE-CS
In-Cooperation:
Los Alamos National Labs, NASA, Argonne Natl Lab

Bibliometrics
Abstract

No abstract available.

Proceeding Downloads

Article
QCD with dynamical fermions on the connection machine

We have implemented Quantum Chromo-Dynamics (QCD) on the massively parallel Connection Machine in *Lisp. The code uses dynamical Wilson fermions and the Hybrid Monte Carlo Algorithm (HMCA) to update the lattice. We describe our program and give ...

Article
Free
Vectorization on Monte Carlo particle transport: an architectural study using the LANL benchmark “GAMTEB”

Fully vectorized versions of the Los Alamos National Laboratory benchmark code Gamteb, a Monte Carlo photon transport algorithm, were developed for the Cyber 205/ETA-10 and Cray X-MP/Y-MP architectures. Single-processor performance measurements of the ...

Article
Free
Parallelizing a large scientific code - methods, issues, and concerns

Objectives of this study were to develop techniques and methods for effective analysis of large codes; to determine the feasibility of parallelizing an existing large scientific code; and to estimate potential speedups attainable, and associated ...

Article
Free
Benchmark calculations with an unstructured grid flow solver on a SIMD computer

An unstructured grid flow solver was implemented on a massively parallel computer, and benchmark computations were performed. The solver was a two-dimensional computational fluid dynamics (CFD) code that performs first-order, steady-state solutions of ...

Article
Free
Implementation of a hypersonic rarefied flow particle simulation on the connection machine

A very efficient direct particle simulation algorithm for hypersonic rarefied flows is presented and its implementation on a Connection Machine is described. The implementation is capable of simulating up to 4 x 106 hard sphere diatomic molecules using ...

Article
Free
Computational aerothermodynamics

Aerothermodynamics is defined1 as “the study of the relationship of heat and mechanical energy in gases, especially air”. To those familiar with fluid dynamics (the study of the flow properties of liquids and gases) this means that we must consider ...

    Article
    Free
    Practical parallel supercomputing: examples from chemistry and physics

    We use two large simulations, the chemical reaction dynamics of H + H2 and the collision of two galaxies to show that current parallel machines are capable of large supercomputer level calculations. We contrast the different architectural tradeoffs for ...

    Article
    Free
    Capability of current supercomputers for the computational fluid dynamics

    The computer code named LANS3D, one of the representative Navier-Stokes codes in Japan, is taken as a example and the capability of the current CFD technology is discussed. This code was developed for the numerical simulation of high-Reynolds number ...

    Article
    Supercomputing of circuits simulation

    The Circuit Analysis is very important in the development of LSI. We have been conducting the speed-up of the circuit analysis program SPICE-GT. SPICE-GT is based on the SPICE 2G.6 program from the University of California Berkely. We achieved speed-up ...

    Article
    Free
    Computations of soil temperature rise due to HVDC ground return

    The purpose of this paper is to present an application which historically, did not make use of computing methodology in the solution of design problems. The design of High Voltage Direct Current (HVDC) ground electrodes involves the careful selection of ...

    Article
    Free
    A radar simulation program for a 1024-processor hypercube

    We have developed a fast parallel version of an existing synthetic aperture radar (SAR) simulation program, SRIM. On a 1024-processor NCUBE hypercube it runs an order of magnitude faster than on a CRAY X-MP or CRAY Y-MP processor. This speed advantage ...

    Article
    Free
    Parallel MIMD programming for global models of atmospheric flow

    Modeling atmospheric flow is one application of supercomputers. In this paper we present some concepts for implementing global flow algorithms on shared memory multiprocessors. We describe how an analysis of the algorithms combined with the appropriate ...

    Article
    Free
    Computational fluid dynamic-current capabilities and directions for the future

    Computational fluid dynamics (CFD) has made great strides in the detailed simulation of complex fluid flows, including some of those not before understood. It is now being routinely applied to some rather complicated problems, and starting to impact the ...

    Article
    Free
    Parallel algorithm and VLSI architecture for a robot's inverse kinematics

    The inverse solutions of a robotic systems are generally produced by a serial process. Due to the computing time of processing geometry data and generating an inverse solution corresponding to a specified point in Cartesian trajectory is larger than the ...

    Article
    Free
    Supercomputers in computational ocean acoustics

    In this paper, we report on some computational experience in solving ocean acoustic propagation problems in three dimensions on supercomputers. The underlying Helmholtz equation is transformed into a parabolic-type equation in the Lee-Saad-Schultz model ...

    Article
    Free
    A study of dissipation operators for the euler equations and a three- dimensional channel flow

    Explicit methods for the solution of fluid flow problems are of considerable interest in supercomputing. These methods parallelize well. The treatment of the boundaries is of particular interest both with respect to the numeric behavior of the solution, ...

    Article
    Free
    A computer assisted optimal depth lower bound for sorting networks with nine inputs

    It is demonstrated that there is no nine-input sorting network of depth six. The proof was obtained by executing on a supercomputer a branch-and-bound algorithm which constructs and tests a critical subset of all possible candidates. Such proofs can be ...

    Article
    Free
    Realities associated with parallel processing

    At the T. J. Watson Research Center, there is a very active Condensed Matter Physics Group engaged in the study of semiconductors such as silicon (Si) and gallium-arsenide (Ga-As)1. One of the most important computer codes developed at Watson is a ...

    Article
    Free
    How a SIMD machine can implement a complex cellular automata? a case study: von Neumann's 29-state cellular automaton

    This study is a part of an effort to simulate the 29-state self-reproducing cellular automaton described by John von Neumann in a manuscript that dates back to 1952. We are interested in the programming of very large SIMD arrays which, as a consequence ...

    Article
    Free
    Automatic vectorization of character string manipulation and relational operations in Pascal

    In our paper of Supercomputing '88, an overview of V-Pascal, an automatic vectorizing compiler for Pascal, was presented with focus on its Version 1. In that paper, as one of those higher functions to be added to Version 2 V-Pascal, vector-mode ...

    Article
    Free
    Neural network simulation on shared-memory vector multiprocessors

    We simulate three neural networks on a vector multiprocrssor. The training time can be reduced significantly especially when the training data size is large. These three neural networks are: 1) the feedforward network, 2) the recurrent network and 3) ...

    Article
    Free
    Concurrent and vectorized Monte Carlo simulation of the evolution of an assembly of particles increasing in number

    Parallel Monte Carlo techniques for simulating the evolution of an assembly of charged particles interacting with a background gas medium under the influence of the electrical field are presented. This simulation problem has inherent parallelism in ...

    Article
    Free
    Protein structure prediction by a data-level parallel algorithm

    We have developed a software system, PHI-PSI, on the Connection Machine that uses a parallel algorithm to retrieve and use information from a database of 112 known protein structures (selected from the Brookhaven Protein Databank) to predict the ...

    Article
    Free
    Vector and parallel algorithms for Cholesky factorization on IBM 3090

    In many engineering applications, a solution of Fx = b is required, where F is a positive definite symmetric matrix. This is usually done by the Cholesky factorization, F = RRT, where R is the lower triangular Cholesky factor. This is a compute ...

    Article
    Free
    FFTs in external of hierarchical memory

    Conventional algorithms for computing large one-dimensional fast Fourier transforms (FFTs), even those algorithms recently developed for vector and parallel computers, are largely unsuitable for systems with external or hierarchical memory. The ...

    Article
    Free
    Macrotasking the singluar value decomposition of block circulant matrices on the Cray-2

    A parallel algorithm to compute the singular value decomposition (SVD) of block circulant matrices on the Cray-2 is described. For a block circulant form described by M blocks with m x n elements in each block, the computation time using an SVD ...

    Article
    Free
    A block QR factorization algorithm using restricted pivoting

    This paper presents a new algorithm for computing the QR factorization of a rank-deficient matrix on high-performance machines. The algorithm is based on the Householder QR factorization algorithm with column pivoting. The traditional pivoting strategy ...

    Article
    Free
    Tuning the rank-n update in a wavefront solver for peak performance

    The wavefront solver is a type of linear equation solver that is suitable for solving the system of linear equations that arises in many finite-element applications. A new version of the wavefront solver was recently introduced into the ANSYS® program ...

    Article
    Free
    Load balancing and task decomposition techniques for parallel implementation of integrated vision systems algorithms

    Integrated vision systems employ a sequence of image understanding algorithms in which the output of an algorithm is the input of the next algorithm in the sequence. Algorithms that constitute an integrated vision systems exhibit different ...

    Article
    Free
    Efficient computation of the singular value decomposition on cube connected SIMD machine

    The singular value decomposition (SVD) has many real-time applications. Recently, there has been much interest in developing efficient methods to compute SVD in parallel machines. This paper presents an efficient method for computing SVD in a cube ...

    Contributors
    • NASA Ames Research Center

    Recommendations

    Acceptance Rates

    Overall Acceptance Rate1,516of6,373submissions,24%
    YearSubmittedAcceptedRate
    SC '173276119%
    SC '164428118%
    SC '153587922%
    SC '143948321%
    SC '134499120%
    SC '1246110022%
    SC '113527421%
    SC '102535120%
    SC '092615923%
    SC '082775921%
    SC '072685420%
    SC '062395423%
    SC '052606224%
    SC '042006030%
    SC '032076029%
    SC '022306729%
    SC '012406025%
    SC '001796235%
    Supercomputing '952416929%
    Supercomputing '933007224%
    Supercomputing '922207534%
    Supercomputing '912158339%
    Overall6,3731,51624%