37,516 research outputs found
Polyphonic music transcription using note onset and offset detection
In this paper, an approach for polyphonic music transcription based on joint multiple-F0 estimation and note onset/offset detection is proposed. For preprocessing, the resonator time-frequency image of the input music signal is extracted and noise suppression is performed. A pitch salience function is extracted for each frame along with tuning and inharmonicity parameters. For onset detection, late fusion is employed by combining a novel spectral flux-based feature which incorporates pitch tuning information and a novel salience function-based descriptor. For each segment defined by two onsets, an overlapping partial treatment procedure is used and a pitch set score function is proposed. A note offset detection procedure is also proposed using HMMs trained on MIDI data. The system was trained on piano chords and tested on classic and jazz recordings from the RWC database. Improved transcription results are reported compared to state-of-the-art approaches
Including patient choice in cost-effectiveness decision rules
There has been increasing discussion in the economic literature about the appropriateness of using general population values within technology appraisal. This paper proposes an alternative approach to incorporating patient values into the cost-effectiveness decision rule that lies at the heart of funding decisions. Whilst the current decision rule is constructed around a technical question, namely, "which treatment is the most cost-effective?", the key policy question is "which treatments should be offered to the patient?". A two-part decision rule is explored which gives the patient the choice of the most cost-effective treatment plus all cheaper options. Whilst the adoption of this patient-based cost-effectiveness rule may not alter many decisions compared to the current approach, it would represent a profound shift in the way that patient values and patient choice are incorporated into economic evaluation
A temporally-constrained convolutive probabilistic model for pitch detection
A method for pitch detection which models the temporal evolution of musical sounds is presented in this paper. The proposed model is based on shift-invariant probabilistic latent component analysis, constrained by a hidden Markov model. The time-frequency representation of a produced musical note can be expressed by the model as a temporal sequence of spectral templates which can also be shifted over log-frequency. Thus, this approach can be effectively used for pitch detection in music signals that contain amplitude and frequency modulations. Experiments were performed using extracted sequences of spectral templates on monophonic music excerpts, where the proposed model outperforms a non-temporally constrained convolutive model for pitch detection. Finally, future directions are given for multipitch extensions of the proposed model
Recommended from our members
Multiple-instrument polyphonic music transcription using a convolutive probabilistic model
(Abstract to follow
Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution
This paper proposes a system for multiple fundamental frequency estimation of piano sounds using pitch candidate selection rules which employ spectral structure and temporal evolution. As a time-frequency representation, the Resonator Time-Frequency Image of the input signal is employed, a noise suppression model is used, and a spectral whitening procedure is performed. In addition, a spectral flux-based onset detector is employed in order to select the steady-state region of the produced sound. In the multiple-F0 estimation stage, tuning and inharmonicity parameters are extracted and a pitch salience function is proposed. Pitch presence tests are performed utilizing information from the spectral structure of pitch candidates, aiming to suppress errors occurring at multiples and sub-multiples of the true pitches. A novel feature for the estimation of harmonically related pitches is proposed, based on the common amplitude modulation assumption. Experiments are performed on the MAPS database using 8784 piano samples of classical, jazz, and random chords with polyphony levels between 1 and 6. The proposed system is computationally inexpensive, being able to perform multiple-F0 estimation experiments in realtime. Experimental results indicate that the proposed system outperforms state-of-the-art approaches for the aforementioned task in a statistically significant manner. Index Terms: multiple-F0 estimation, resonator timefrequency image, common amplitude modulatio
PYIN: A FUNDAMENTAL FREQUENCY ESTIMATOR USING PROBABILISTIC THRESHOLD DISTRIBUTIONS
© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Higher Spin BRS Cohomology of Supersymmetric Chiral Matter in D=4
We examine the BRS cohomology of chiral matter in , supersymmetry
to determine a general form of composite superfield operators which can suffer
from supersymmetry anomalies. Composite superfield operators \Y_{(a,b)} are
products of the elementary chiral superfields and \ov S and the
derivative operators D_\a, \ov D_{\dot \b} and \pa_{\a \dot \b}. Such
superfields \Y_{(a,b)} can be chosen to have `' symmetrized undotted
indices \a_i and `' symmetrized dotted indices \dot \b_j. The result
derived here is that each composite superfield \Y_{(a,b)} is subject to
potential supersymmetry anomalies if is an odd number, which means that
\Y_{(a,b)} is a fermionic superfield.Comment: 15 pages, CPT-TAMU-20/9
Eddy current generation enhancement using ferrite for electromagnetic acoustic transduction
Eddy currents are generated in an electrically conducting surface as a step in electromagnetic acoustic transduction (EAT). In eddy current testing, wire coils are often wound onto a ferrite core to increase the generated eddy current. With EAT, increased coil inductance is unacceptable as it leads to a reduction in the amplitude of a given frequency of eddy current from a limited voltage source, particularly where the current arises from capacitor discharge. The authors present a method for EAT where ferrite is used to increase the eddy current amplitude without significantly increasing coil inductance or changing the frequency content of the eddy current
AIDS in Botswana: Evaluating the general equilibrium implications of healthcare interventions
This paper reports an analysis of the effects of health care interventions designed to reduce the impacts of the HIV/AIDS epidemic on the Botswana economy. The analyses were conducted using a recursive dynamic computable general equilibrium model for Botswana within which was embedded a compartmental epidemiological model. The health care interventions examined are reductions in other sexually transmitted diseases (STDs) that reduce the probability of HIV transmission and a mass media health education programme that reduces the number of new sexual partnerships being formed. While the policy scenarios examined are, necessarily, somewhat stylised, the results indicate both the devastating adverse effects of the epidemic and the substantial potential benefits of the interventions. Without interventions disposable household incomes per capita are up to 50 per cent less than they would have been in 2020, but with these interventions the adverse effects of the epidemic are more than halved
The Temperament Police: The Truth, the Ground Truth, and Nothing but the Truth
The tuning system of a keyboard instrument is chosen so that frequently used musical intervals sound as consonant as possible. Temperament refers to the compromise arising from the fact that not all intervals can be maximally consonant simultaneously. Recent work showed that it is possible to estimate temperament from audio recordings with no prior knowledge of the musical score, using a conservative (high precision, low recall) automatic transcription algorithm followed by frequency estimation using quadratic interpolation and bias correction from the log magnitude spectrum. In this paper we develop a harpsichord-specific transcription system to analyse over 500 recordings of solo harpsichord music for which the temperament is specified on the CD sleeve notes. We compare the measured temperaments with the annotations and discuss the differences between temperament as a theoretical construct and as a practical issue for professional performers and tuners. The implications are that ground truth is not always scientific truth, and that content-based analysis has an important role in the study of historical performance practice. 1
- …
