2,322 research outputs found
Improved Algorithms for Approximate String Matching (Extended Abstract)
The problem of approximate string matching is important in many different
areas such as computational biology, text processing and pattern recognition. A
great effort has been made to design efficient algorithms addressing several
variants of the problem, including comparison of two strings, approximate
pattern identification in a string or calculation of the longest common
subsequence that two strings share.
We designed an output sensitive algorithm solving the edit distance problem
between two strings of lengths n and m respectively in time
O((s-|n-m|)min(m,n,s)+m+n) and linear space, where s is the edit distance
between the two strings. This worst-case time bound sets the quadratic factor
of the algorithm independent of the longest string length and improves existing
theoretical bounds for this problem. The implementation of our algorithm excels
also in practice, especially in cases where the two strings compared differ
significantly in length. Source code of our algorithm is available at
http://www.cs.miami.edu/\~dimitris/edit_distanceComment: 10 page
Measuring Global Similarity between Texts
We propose a new similarity measure between texts which, contrary to the
current state-of-the-art approaches, takes a global view of the texts to be
compared. We have implemented a tool to compute our textual distance and
conducted experiments on several corpuses of texts. The experiments show that
our methods can reliably identify different global types of texts.Comment: Submitted to SLSP 201
Elastic properties of grafted microtubules
We use single-particle tracking to study the elastic properties of single
microtubules grafted to a substrate. Thermal fluctuations of the free
microtubule's end are recorded, in order to measure position distribution
functions from which we calculate the persistence length of microtubules with
contour lengths between 2.6 and 48 micrometers. We find the persistence length
to vary by more than a factor of 20 over the total range of contour lengths.
Our results support the hypothesis that shearing between protofilaments
contributes significantly to the mechanics of microtubules.Comment: 9 pages, 3 figure
MC64: A web platform to test bioinformatics algorithms in a many-core architecture
New analytical methodologies, like the so-called "next-generation sequencing" (NGS), allow the sequencing of full genomes with high speed and reduced price. Yet, such technologies generate huge amounts of data that demand large raw computational power. Many-core technologies can be exploited to overcome the involved bioinformatics bottleneck. Indeed, such hardware is currently in active development. We have developed parallel bioinformatics algorithms for many-core microprocessors containing 64 cores each. Thus, the MC64 web platform allows executing high-performance alignments (Needleman-Wunsch, Smith-Waterman and ClustalW) of long sequences. The MC64 platform can be accessed via web browsers, allowing easy resource integration into third-party tools. Furthermore, the results obtained from the MC64 include time-performance statistics that can be compared with other platform
Generalized Interpolation Material Point Approach to High Melting Explosive with Cavities Under Shock
Criterion for contacting is critically important for the Generalized
Interpolation Material Point(GIMP) method. We present an improved criterion by
adding a switching function. With the method dynamical response of high melting
explosive(HMX) with cavities under shock is investigated. The physical model
used in the present work is an elastic-to-plastic and thermal-dynamical model
with Mie-Gr\"uneissen equation of state. We mainly concern the influence of
various parameters, including the impacting velocity , cavity size , etc,
to the dynamical and thermodynamical behaviors of the material. For the
colliding of two bodies with a cavity in each, a secondary impacting is
observed. Correspondingly, the separation distance of the two bodies has a
maximum value in between the initial and second impacts. When the
initial impacting velocity is not large enough, the cavity collapses in a
nearly symmetric fashion, the maximum separation distance increases
with . When the initial shock wave is strong enough to collapse the cavity
asymmetrically along the shock direction, the variation of with
does not show monotonic behavior. Our numerical results show clear indication
that the existence of cavities in explosive helps the creation of ``hot
spots''.Comment: Figs.2,4,7,11 in JPG format; Accepted for publication in J. Phys. D:
Applied Physic
Crack-Like Processes Governing the Onset of Frictional Slip
We perform real-time measurements of the net contact area between two blocks
of like material at the onset of frictional slip. We show that the process of
interface detachment, which immediately precedes the inception of frictional
sliding, is governed by three different types of detachment fronts. These
crack-like detachment fronts differ by both their propagation velocities and by
the amount of net contact surface reduction caused by their passage. The most
rapid fronts propagate at intersonic velocities but generate a negligible
reduction in contact area across the interface. Sub-Rayleigh fronts are
crack-like modes which propagate at velocities up to the Rayleigh wave speed,
VR, and give rise to an approximate 10% reduction in net contact area. The most
efficient contact area reduction (~20%) is precipitated by the passage of slow
detachment fronts. These fronts propagate at anomalously slow velocities, which
are over an order of magnitude lower than VR yet orders of magnitude higher
than other characteristic velocity scales such as either slip or loading
velocities. Slow fronts are generated, in conjunction with intersonic fronts,
by the sudden arrest of sub-Rayleigh fronts. No overall sliding of the
interface occurs until either of the slower two fronts traverses the entire
interface, and motion at the leading edge of the interface is initiated. Slip
at the trailing edge of the interface accompanies the motion of both the slow
and sub-Rayleigh fronts. We might expect these modes to be important in both
fault nucleation and earthquake dynamics.Comment: 19 page, 5 figures, to appear in International Journal of Fractur
Residual cognitive deficits 50 years after lead poisoning during childhood
The long term neurobehavioural consequences of childhood lead poisoning are not known. In this study adult subjects with a documented history of lead poisoning before age 4 and matched controls were examined with an abbreviated battery of neuropsychological tests including measures of attention, reasoning, memory, motor speed, and current mood. The subjects exposed to lead were inferior to controls on almost all of the cognitive tasks. This pattern of widespread deficits resembles that found in children evaluated at the time of acute exposure to lead rather than the more circumscribed pattern typically seen in adults exposed to lead. Despite having completed as many years of schooling as controls, the subjects exposed to lead were lower in lifetime occupational status. Within the exposed group, performance on the neuropsychological battery and occupational status were related, consistent with the presumed impact of limitations in neuropsychological functioning on everyday life. The results suggest that many subjects exposed to lead suffered acute encephalopathy in childhood which resolved into a chronic subclinical encephalopathy with associated cognitive dysfunction still evident in adulthood. These findings lend support to efforts to limit exposure to lead in childhood
Changes in dental plaque following hospitalisation in a critical care unit: an observational study
Additional funding was provided by a grant
from the Faculty of Dental Surgery, Royal College of Surgeons, England, and
this work was undertaken at University College London/University College
London Hospitals, which received a proportion of funding from the
Department of Health’s National Institute for Health Research Biomedical
Research Centres funding scheme
An evolutionary technique to approximate multiple optimal alignments
The alignment of observed and modeled behavior is an essential aid for organizations, since it opens the door for root-cause analysis and enhancement of processes. The state-of-the-art technique for computing alignments has exponential time and space complexity, hindering its applicability for medium and large instances. Moreover, the fact that there may be multiple optimal alignments is perceived as a negative situation, while in reality it may provide a more comprehensive picture of the model’s explanation of observed behavior, from which other techniques may benefit. This paper presents a novel evolutionary technique for approximating multiple optimal alignments. Remarkably, the memory footprint of the proposed technique is bounded, representing an unprecedented guarantee with respect to the state-of-the-art methods for the same task. The technique is implemented into a tool, and experiments on several benchmarks are provided.Peer ReviewedPostprint (author's final draft
An optimized TOPS+ comparison method for enhanced TOPS models
This article has been made available through the Brunel Open Access Publishing Fund.Background
Although methods based on highly abstract descriptions of protein structures, such as VAST and TOPS, can perform very fast protein structure comparison, the results can lack a high degree of biological significance. Previously we have discussed the basic mechanisms of our novel method for structure comparison based on our TOPS+ model (Topological descriptions of Protein Structures Enhanced with Ligand Information). In this paper we show how these results can be significantly improved using parameter optimization, and we call the resulting optimised TOPS+ method as advanced TOPS+ comparison method i.e. advTOPS+.
Results
We have developed a TOPS+ string model as an improvement to the TOPS [1-3] graph model by considering loops as secondary structure elements (SSEs) in addition to helices and strands, representing ligands as first class objects, and describing interactions between SSEs, and SSEs and ligands, by incoming and outgoing arcs, annotating SSEs with the interaction direction and type. Benchmarking results of an all-against-all pairwise comparison using a large dataset of 2,620 non-redundant structures from the PDB40 dataset [4] demonstrate the biological significance, in terms of SCOP classification at the superfamily level, of our TOPS+ comparison method.
Conclusions
Our advanced TOPS+ comparison shows better performance on the PDB40 dataset [4] compared to our basic TOPS+ method, giving 90 percent accuracy for SCOP alpha+beta; a 6 percent increase in accuracy compared to the TOPS and basic TOPS+ methods. It also outperforms the TOPS, basic TOPS+ and SSAP comparison methods on the Chew-Kedem dataset [5], achieving 98 percent accuracy. Software Availability: The TOPS+ comparison server is available at http://balabio.dcs.gla.ac.uk/mallika/WebTOPS/.This article is available through the Brunel Open Access Publishing Fun
- …
