29,911 research outputs found
Bayesian Speaker Adaptation Based on a New Hierarchical Probabilistic Model
In this paper, a new hierarchical Bayesian speaker adaptation method called HMAP is proposed that combines the advantages of three conventional algorithms, maximum a posteriori (MAP), maximum-likelihood linear regression (MLLR), and eigenvoice, resulting in excellent performance across a wide range of adaptation conditions. The new method efficiently utilizes intra-speaker and inter-speaker correlation information through modeling phone and speaker subspaces in a consistent hierarchical Bayesian way. The phone variations for a specific speaker are assumed to be located in a low-dimensional subspace. The phone coordinate, which is shared among different speakers, implicitly contains the intra-speaker correlation information. For a specific speaker, the phone variation, represented by speaker-dependent eigenphones, are concatenated into a supervector. The eigenphone supervector space is also a low dimensional speaker subspace, which contains inter-speaker correlation information. Using principal component analysis (PCA), a new hierarchical probabilistic model for the generation of the speech observations is obtained. Speaker adaptation based on the new hierarchical model is derived using the maximum a posteriori criterion in a top-down manner. Both batch adaptation and online adaptation schemes are proposed. With tuned parameters, the new method can handle varying amounts of adaptation data automatically and efficiently. Experimental results on a Mandarin Chinese continuous speech recognition task show good performance under all testing conditions
ImageSpirit: Verbal Guided Image Parsing
Humans describe images in terms of nouns and adjectives while algorithms
operate on images represented as sets of pixels. Bridging this gap between how
humans would like to access images versus their typical representation is the
goal of image parsing, which involves assigning object and attribute labels to
pixel. In this paper we propose treating nouns as object labels and adjectives
as visual attribute labels. This allows us to formulate the image parsing
problem as one of jointly estimating per-pixel object and attribute labels from
a set of training images. We propose an efficient (interactive time) solution.
Using the extracted labels as handles, our system empowers a user to verbally
refine the results. This enables hands-free parsing of an image into pixel-wise
object/attribute labels that correspond to human semantics. Verbally selecting
objects of interests enables a novel and natural interaction modality that can
possibly be used to interact with new generation devices (e.g. smart phones,
Google Glass, living room devices). We demonstrate our system on a large number
of real-world images with varying complexity. To help understand the tradeoffs
compared to traditional mouse based interactions, results are reported for both
a large scale quantitative evaluation and a user study.Comment: http://mmcheng.net/imagespirit
Efficacy of Ultrasound-guided Radiofrequency Ablation of Parathyroid Hyperplasia: Single Session vs. Two-Session for Effect on Hypocalcemia
To evaluate safety and efficacy of one- vs. two-session radiofrequency ablation (RFA) of parathyroid hyperplasia for patients with secondary hyperparathyroidism (SHPT) and to compare the outcome of both methods on hypocalcemia. Patients with secondary hyperparathyroidism underwent ultrasound guided RFA of parathyroid hyperplasia. Patients were alternately assigned to either group 1 (n = 28) with RFA of all 4 glands in one session or group 2 (n = 28) with RFA of 2 glands in a first session and other 2 glands in a second session. Serum parathyroid hormone (PTH), calcium, phosphorus and alkaline phosphatase (ALP) values were measured at a series of time points after RFA. RFA parameters, including operation duration and ablation time and hospitalization length and cost, were compared between the two groups. Mean PTH decreased in group 1 from 1865.18 ± 828.93 pg/ml to 145.72 ± 119.27 pg/ml at 1 day after RFA and in group 2 from 2256.64 ± 1021.72 pg/ml to 1388.13 ± 890.15 pg/ml at 1 day after first RFA and to 137.26 ± 107.12 pg/ml at 1 day after second RFA. Group 1\u27s calcium level decreased to 1.79 ± 0.31 mmol/L at day 1 after RFA and group 2 decreased to 1.89 ± 0.26 mmol/L at day 1 after second session RFA (P \u3c 0.05). Multivariate analysis showed that hypocalcemia was related to serum ALP. Patients with ALP ≥ 566 U/L had lower calcium compared to patients with ALP \u3c 566 U/L up to a month after RFA (P \u3c 0.05). Group 1\u27s RFA time and hospitalization were shorter and had lower cost compared with Group 2. US-guided RFA of parathyroid hyperplasia is a safe and effective method for treating secondary hyperparathyroidism. Single-session RFA was more cost-effective and resulted in a shorter hospital stay compared to two sessions. However, patients with two-session RFA had less hypocalcemia, especially those with high ALP
- …
