2,757 research outputs found

    Counting reducible, powerful, and relatively irreducible multivariate polynomials over finite fields

    Full text link
    We present counting methods for some special classes of multivariate polynomials over a finite field, namely the reducible ones, the s-powerful ones (divisible by the s-th power of a nonconstant polynomial), and the relatively irreducible ones (irreducible but reducible over an extension field). One approach employs generating functions, another one uses a combinatorial method. They yield exact formulas and approximations with relative errors that essentially decrease exponentially in the input size.Comment: to appear in SIAM Journal on Discrete Mathematic

    Benchmarking computer platforms for lattice QCD applications

    Full text link
    We define a benchmark suite for lattice QCD and report on benchmark results from several computer platforms. The platforms considered are apeNEXT, CRAY T3E, Hitachi SR8000, IBM p690, PC-Clusters, and QCDOC.Comment: 3 pages, Lattice03, machines and algorithm

    Better than $1/Mflops sustained: a scalable PC-based parallel computer for lattice QCD

    Full text link
    We study the feasibility of a PC-based parallel computer for medium to large scale lattice QCD simulations. The E\"otv\"os Univ., Inst. Theor. Phys. cluster consists of 137 Intel P4-1.7GHz nodes with 512 MB RDRAM. The 32-bit, single precision sustained performance for dynamical QCD without communication is 1510 Mflops/node with Wilson and 970 Mflops/node with staggered fermions. This gives a total performance of 208 Gflops for Wilson and 133 Gflops for staggered QCD, respectively (for 64-bit applications the performance is approximately halved). The novel feature of our system is its communication architecture. In order to have a scalable, cost-effective machine we use Gigabit Ethernet cards for nearest-neighbor communications in a two-dimensional mesh. This type of communication is cost effective (only 30% of the hardware costs is spent on the communication). According to our benchmark measurements this type of communication results in around 40% communication time fraction for lattices upto 48^3\cdot96 in full QCD simulations. The price/sustained-performance ratio for full QCD is better than 1/MflopsforWilson(andaround1/Mflops for Wilson (and around 1.5/Mflops for staggered) quarks for practically any lattice size, which can fit in our parallel computer. The communication software is freely available upon request for non-profit organizations.Comment: 14 pages, 3 figures, final version to appear in Comp.Phys.Com

    Cracking in asphalt materials

    Get PDF
    This chapter provides a comprehensive review of both laboratory characterization and modelling of bulk material fracture in asphalt mixtures. For the purpose of organization, this chapter is divided into a section on laboratory tests and a section on models. The laboratory characterization section is further subdivided on the basis of predominant loading conditions (monotonic vs. cyclic). The section on constitutive models is subdivided into two sections, the first one containing fracture mechanics based models for crack initiation and propagation that do not include material degradation due to cyclic loading conditions. The second section discusses phenomenological models that have been developed for crack growth through the use of dissipated energy and damage accumulation concepts. These latter models have the capability to simulate degradation of material capacity upon exceeding a threshold number of loading cycles.Peer ReviewedPostprint (author's final draft

    Hardware and software status of QCDOC

    Full text link
    QCDOC is a massively parallel supercomputer whose processing nodes are based on an application-specific integrated circuit (ASIC). This ASIC was custom-designed so that crucial lattice QCD kernels achieve an overall sustained performance of 50% on machines with several 10,000 nodes. This strong scalability, together with low power consumption and a price/performance ratio of $1 per sustained MFlops, enable QCDOC to attack the most demanding lattice QCD problems. The first ASICs became available in June of 2003, and the testing performed so far has shown all systems functioning according to specification. We review the hardware and software status of QCDOC and present performance figures obtained in real hardware as well as in simulation.Comment: Lattice2003(machine), 6 pages, 5 figure

    Statistical features of edge turbulence in RFX-mod from Gas Puffing Imaging

    Get PDF
    Plasma density fluctuations in the edge plasma of the RFX-mod device are measured through the Gas Puffing Imaging Diagnostics. Statistical features of the signal are quantified in terms of the Probability Distribution Function (PDF), and computed for several kinds of discharges. The PDFs from discharges without particular control methods are found to be adequately described by a Gamma function, consistently with the recent results by Graves et al [J.P. Graves, et al, Plasma Phys. Control. Fusion 47, L1 (2005)]. On the other hand, pulses with external methods for plasma control feature modified PDFs. A first empirical analysis suggests that they may be interpolated through a linear combination of simple functions. An inspection of the literature shows that this kind of PDFs is common to other devices as well, and has been suggested to be due to the simultaneous presence of different mechanisms driving respectively coherent bursts and gaussian background turbulence. An attempt is made to relate differences in the PDFs to plasma conditions such as the local shift of the plasma column. A simple phenomenological model to interpret the nature of the PDF and assign a meaning to its parameters is also developed.Comment: 27 pages. Published in PPC

    APEnet+: high bandwidth 3D torus direct network for petaflops scale commodity clusters

    Full text link
    We describe herein the APElink+ board, a PCIe interconnect adapter featuring the latest advances in wire speed and interface technology plus hardware support for a RDMA programming model and experimental acceleration of GPU networking; this design allows us to build a low latency, high bandwidth PC cluster, the APEnet+ network, the new generation of our cost-effective, tens-of-thousands-scalable cluster network architecture. Some test results and characterization of data transmission of a complete testbench, based on a commercial development card mounting an Altera FPGA, are provided.Comment: 6 pages, 7 figures, proceeding of CHEP 2010, Taiwan, October 18-2
    corecore