2,757 research outputs found
Counting reducible, powerful, and relatively irreducible multivariate polynomials over finite fields
We present counting methods for some special classes of multivariate
polynomials over a finite field, namely the reducible ones, the s-powerful ones
(divisible by the s-th power of a nonconstant polynomial), and the relatively
irreducible ones (irreducible but reducible over an extension field). One
approach employs generating functions, another one uses a combinatorial method.
They yield exact formulas and approximations with relative errors that
essentially decrease exponentially in the input size.Comment: to appear in SIAM Journal on Discrete Mathematic
Benchmarking computer platforms for lattice QCD applications
We define a benchmark suite for lattice QCD and report on benchmark results
from several computer platforms. The platforms considered are apeNEXT, CRAY
T3E, Hitachi SR8000, IBM p690, PC-Clusters, and QCDOC.Comment: 3 pages, Lattice03, machines and algorithm
Better than $1/Mflops sustained: a scalable PC-based parallel computer for lattice QCD
We study the feasibility of a PC-based parallel computer for medium to large
scale lattice QCD simulations. The E\"otv\"os Univ., Inst. Theor. Phys. cluster
consists of 137 Intel P4-1.7GHz nodes with 512 MB RDRAM. The 32-bit, single
precision sustained performance for dynamical QCD without communication is 1510
Mflops/node with Wilson and 970 Mflops/node with staggered fermions. This gives
a total performance of 208 Gflops for Wilson and 133 Gflops for staggered QCD,
respectively (for 64-bit applications the performance is approximately halved).
The novel feature of our system is its communication architecture. In order to
have a scalable, cost-effective machine we use Gigabit Ethernet cards for
nearest-neighbor communications in a two-dimensional mesh. This type of
communication is cost effective (only 30% of the hardware costs is spent on the
communication). According to our benchmark measurements this type of
communication results in around 40% communication time fraction for lattices
upto 48^3\cdot96 in full QCD simulations. The price/sustained-performance ratio
for full QCD is better than 1.5/Mflops for
staggered) quarks for practically any lattice size, which can fit in our
parallel computer. The communication software is freely available upon request
for non-profit organizations.Comment: 14 pages, 3 figures, final version to appear in Comp.Phys.Com
Cracking in asphalt materials
This chapter provides a comprehensive review of both laboratory characterization and modelling of bulk material fracture in asphalt mixtures. For the purpose of organization, this chapter is divided into a section on laboratory tests and a section on models. The laboratory characterization section is further subdivided on the basis of predominant loading conditions (monotonic vs. cyclic). The section on constitutive models is subdivided into two sections, the first one containing fracture mechanics based models for crack initiation and propagation that do not include material degradation due to cyclic loading conditions. The second section discusses phenomenological models that have been developed for crack growth through the use of dissipated energy and damage accumulation concepts. These latter models have the capability to simulate degradation of material capacity upon exceeding a threshold number of loading cycles.Peer ReviewedPostprint (author's final draft
Hardware and software status of QCDOC
QCDOC is a massively parallel supercomputer whose processing nodes are based
on an application-specific integrated circuit (ASIC). This ASIC was
custom-designed so that crucial lattice QCD kernels achieve an overall
sustained performance of 50% on machines with several 10,000 nodes. This strong
scalability, together with low power consumption and a price/performance ratio
of $1 per sustained MFlops, enable QCDOC to attack the most demanding lattice
QCD problems. The first ASICs became available in June of 2003, and the testing
performed so far has shown all systems functioning according to specification.
We review the hardware and software status of QCDOC and present performance
figures obtained in real hardware as well as in simulation.Comment: Lattice2003(machine), 6 pages, 5 figure
Statistical features of edge turbulence in RFX-mod from Gas Puffing Imaging
Plasma density fluctuations in the edge plasma of the RFX-mod device are
measured through the Gas Puffing Imaging Diagnostics. Statistical features of
the signal are quantified in terms of the Probability Distribution Function
(PDF), and computed for several kinds of discharges. The PDFs from discharges
without particular control methods are found to be adequately described by a
Gamma function, consistently with the recent results by Graves et al [J.P.
Graves, et al, Plasma Phys. Control. Fusion 47, L1 (2005)]. On the other hand,
pulses with external methods for plasma control feature modified PDFs. A first
empirical analysis suggests that they may be interpolated through a linear
combination of simple functions. An inspection of the literature shows that
this kind of PDFs is common to other devices as well, and has been suggested to
be due to the simultaneous presence of different mechanisms driving
respectively coherent bursts and gaussian background turbulence. An attempt is
made to relate differences in the PDFs to plasma conditions such as the local
shift of the plasma column. A simple phenomenological model to interpret the
nature of the PDF and assign a meaning to its parameters is also developed.Comment: 27 pages. Published in PPC
APEnet+: high bandwidth 3D torus direct network for petaflops scale commodity clusters
We describe herein the APElink+ board, a PCIe interconnect adapter featuring
the latest advances in wire speed and interface technology plus hardware
support for a RDMA programming model and experimental acceleration of GPU
networking; this design allows us to build a low latency, high bandwidth PC
cluster, the APEnet+ network, the new generation of our cost-effective,
tens-of-thousands-scalable cluster network architecture. Some test results and
characterization of data transmission of a complete testbench, based on a
commercial development card mounting an Altera FPGA, are provided.Comment: 6 pages, 7 figures, proceeding of CHEP 2010, Taiwan, October 18-2
- …
