815 research outputs found
Generalized Permutohedra from Probabilistic Graphical Models
A graphical model encodes conditional independence relations via the Markov
properties. For an undirected graph these conditional independence relations
can be represented by a simple polytope known as the graph associahedron, which
can be constructed as a Minkowski sum of standard simplices. There is an
analogous polytope for conditional independence relations coming from a regular
Gaussian model, and it can be defined using multiinformation or relative
entropy. For directed acyclic graphical models and also for mixed graphical
models containing undirected, directed and bidirected edges, we give a
construction of this polytope, up to equivalence of normal fans, as a Minkowski
sum of matroid polytopes. Finally, we apply this geometric insight to construct
a new ordering-based search algorithm for causal inference via directed acyclic
graphical models.Comment: Appendix B is expanded. Final version to appear in SIAM J. Discrete
Mat
‘‘There’s so much more to it than what I initially thought’’: Stepping into researchers’ shoes with a class activity in a first year psychology survey course
In psychology, it is widely agreed that research methods, although central to the discipline, are particularly challenging to learn and teach, particularly at introductory level. This pilot study explored the potential of embedding a student-conducted research activity in a one-semester undergraduate Introduction to Psychology survey course, with the aims of (a) engaging students with the topic of research methods; (b) developing students’ comprehension and application of research methods concepts; and (c) building students’ ability to link research with theory. The research activity explored shoe ownership, examining gender differences and relationships with age, and linking to theories of gender difference and of consumer identity. The process of carrying out the research and reflecting on it created a contextualized, active learning environment in which students themselves raised many issues that research methods lectures seek to cover. Students also wrote richer assignments than standard first year mid-term essay
The IBMAP approach for Markov networks structure learning
In this work we consider the problem of learning the structure of Markov
networks from data. We present an approach for tackling this problem called
IBMAP, together with an efficient instantiation of the approach: the IBMAP-HC
algorithm, designed for avoiding important limitations of existing
independence-based algorithms. These algorithms proceed by performing
statistical independence tests on data, trusting completely the outcome of each
test. In practice tests may be incorrect, resulting in potential cascading
errors and the consequent reduction in the quality of the structures learned.
IBMAP contemplates this uncertainty in the outcome of the tests through a
probabilistic maximum-a-posteriori approach. The approach is instantiated in
the IBMAP-HC algorithm, a structure selection strategy that performs a
polynomial heuristic local search in the space of possible structures. We
present an extensive empirical evaluation on synthetic and real data, showing
that our algorithm outperforms significantly the current independence-based
algorithms, in terms of data efficiency and quality of learned structures, with
equivalent computational complexities. We also show the performance of IBMAP-HC
in a real-world application of knowledge discovery: EDAs, which are
evolutionary algorithms that use structure learning on each generation for
modeling the distribution of populations. The experiments show that when
IBMAP-HC is used to learn the structure, EDAs improve the convergence to the
optimum
Current-induced cooling phenomenon in a two-dimensional electron gas under a magnetic field
We investigate the spatial distribution of temperature induced by a dc
current in a two-dimensional electron gas (2DEG) subjected to a perpendicular
magnetic field. We numerically calculate the distributions of the electrostatic
potential phi and the temperature T in a 2DEG enclosed in a square area
surrounded by insulated-adiabatic (top and bottom) and isopotential-isothermal
(left and right) boundaries (with phi_{left} < phi_{right} and T_{left}
=T_{right}), using a pair of nonlinear Poisson equations (for phi and T) that
fully take into account thermoelectric and thermomagnetic phenomena, including
the Hall, Nernst, Ettingshausen, and Righi-Leduc effects. We find that, in the
vicinity of the left-bottom corner, the temperature becomes lower than the
fixed boundary temperature, contrary to the naive expectation that the
temperature is raised by the prevalent Joule heating effect. The cooling is
attributed to the Ettingshausen effect at the bottom adiabatic boundary, which
pumps up the heat away from the bottom boundary. In order to keep the adiabatic
condition, downward temperature gradient, hence the cooled area, is developed
near the boundary, with the resulting thermal diffusion compensating the upward
heat current due to the Ettingshausen effect.Comment: 25 pages, 7 figure
The identification of informative genes from multiple datasets with increasing complexity
Background
In microarray data analysis, factors such as data quality, biological variation, and the increasingly multi-layered nature of more complex biological systems complicates the modelling of regulatory networks that can represent and capture the interactions among genes. We believe that the use of multiple datasets derived from related biological systems leads to more robust models. Therefore, we developed a novel framework for modelling regulatory networks that involves training and evaluation on independent datasets. Our approach includes the following steps: (1) ordering the datasets based on their level of noise and informativeness; (2) selection of a Bayesian classifier with an appropriate level of complexity by evaluation of predictive performance on independent data sets; (3) comparing the different gene selections and the influence of increasing the model complexity; (4) functional analysis of the informative genes.
Results
In this paper, we identify the most appropriate model complexity using cross-validation and independent test set validation for predicting gene expression in three published datasets related to myogenesis and muscle differentiation. Furthermore, we demonstrate that models trained on simpler datasets can be used to identify interactions among genes and select the most informative. We also show that these models can explain the myogenesis-related genes (genes of interest) significantly better than others (P < 0.004) since the improvement in their rankings is much more pronounced. Finally, after further evaluating our results on synthetic datasets, we show that our approach outperforms a concordance method by Lai et al. in identifying informative genes from multiple datasets with increasing complexity whilst additionally modelling the interaction between genes.
Conclusions
We show that Bayesian networks derived from simpler controlled systems have better performance than those trained on datasets from more complex biological systems. Further, we present that highly predictive and consistent genes, from the pool of differentially expressed genes, across independent datasets are more likely to be fundamentally involved in the biological process under study. We conclude that networks trained on simpler controlled systems, such as in vitro experiments, can be used to model and capture interactions among genes in more complex datasets, such as in vivo experiments, where these interactions would otherwise be concealed by a multitude of other ongoing events
A review on probabilistic graphical models in evolutionary computation
Thanks to their inherent properties, probabilistic graphical models are one of the prime candidates for machine learning and decision making tasks especially in uncertain domains. Their capabilities, like representation, inference and learning, if used effectively, can greatly help to build intelligent systems that are able to act accordingly in different problem domains. Evolutionary algorithms is one such discipline that has employed probabilistic graphical models to improve the search for optimal solutions in complex problems. This paper shows how probabilistic graphical models have been used in evolutionary algorithms to improve their performance in solving complex problems. Specifically, we give a survey of probabilistic model building-based evolutionary algorithms, called estimation of distribution algorithms, and compare different methods for probabilistic modeling in these algorithms
The Use of Case Study Competitions to Prepare Students for the World of Work
As we continue into the new millennium, it is imperative that educational institutions equip graduates with the knowledge and skills that are increasingly needed and valued by business and industry. In this article, the authors argue that the case study approach and, specifically, case study competitions constitute an ideal pedagogical strategy for achieving this objective in an effective and efficient manner, with resulting benefits for both students and employers
Learning from peer feedback on student-generated multiple choice questions: Views of introductory physics students
PeerWise is an online application where students are encouraged to generate a bank of multiple choice questions for their classmates to answer. After answering a question, students can provide feedback to the question author about the quality of the question and the question author can respond to this. Student use of, and attitudes to, this online community within PeerWise was investigated in two large first year undergraduate physics courses, across three academic years, to explore how students interact with the system and the extent to which they believe PeerWise to be useful to their learning. Most students recognized that there is value in engaging with PeerWise, and many students engaged deeply with the system, thinking critically about the quality of their submissions and reflecting on feedback provided to them. Students also valued the breadth of topics and level of difficulty offered by the questions, recognized the revision benefits afforded by the resource, and were often willing to contribute to the community by providing additional explanations and engaging in discussion
A genetic algorithm-Bayesian network approach for the analysis of metabolomics and spectroscopic data: application to the rapid detection of Bacillus spores and identification of Bacillus species
Background
The rapid identification of Bacillus spores and bacterial identification are paramount because of their implications in food poisoning, pathogenesis and their use as potential biowarfare agents. Many automated analytical techniques such as Curie-point pyrolysis mass spectrometry (Py-MS) have been used to identify bacterial spores giving use to large amounts of analytical data. This high number of features makes interpretation of the data extremely difficult We analysed Py-MS data from 36 different strains of aerobic endospore-forming bacteria encompassing seven different species. These bacteria were grown axenically on nutrient agar and vegetative biomass and spores were analyzed by Curie-point Py-MS.
Results
We develop a novel genetic algorithm-Bayesian network algorithm that accurately identifies sand selects a small subset of key relevant mass spectra (biomarkers) to be further analysed. Once identified, this subset of relevant biomarkers was then used to identify Bacillus spores successfully and to identify Bacillus species via a Bayesian network model specifically built for this reduced set of features.
Conclusions
This final compact Bayesian network classification model is parsimonious, computationally fast to run and its graphical visualization allows easy interpretation of the probabilistic relationships among selected biomarkers. In addition, we compare the features selected by the genetic algorithm-Bayesian network approach with the features selected by partial least squares-discriminant analysis (PLS-DA). The classification accuracy results show that the set of features selected by the GA-BN is far superior to PLS-DA
Recommended from our members
Understanding non-governmental organizations in world politics: the promise and pitfalls of the early ‘science of internationalism’
The years immediately preceding the First World War witnessed the development of a significant body of literature claiming to establish a ‘science of internationalism’. This article draws attention to the importance of this literature, especially in relation to understanding the roles of non-governmental organizations in world politics. It elaborates the ways in which this literature sheds light on issues that have become central to twenty-first century debates, including the characteristics, influence, and legitimacy of non-governmental organizations in international relations. Amongst the principal authors discussed in the article are Paul Otlet, Henri La Fontaine and Alfred Fried, whose role in the development of international theory has previously received insufficient attention. The article concludes with evaluation of potential lessons to be drawn from the experience of the early twentieth century ‘science of internationalism’
- …
