340 research outputs found
RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale
The instruction-following ability of Large Language Models (LLMs) has
cultivated a class of LLM-based systems capable of approaching complex tasks
such as making edits to large code repositories. Due to the high sensitivity
and unpredictability of LLM behavior in response to changes in prompting,
robust evaluation tools are needed to drive future iteration of these systems.
We propose RES-Q, a natural language instruction-based benchmark for evaluating
epository diting ystems, which consists of
100 handcrafted repository editing tasks derived from real GitHub commits.
Given an edit instruction and a code repository, RES-Q evaluates an LLM
system's ability to interpret the instruction, navigate the repository to
gather relevant information, and construct an appropriate edit that satisfies
the specified criteria. We argue that evaluating LLMs in this way addresses
issues with traditional benchmarks and provides a more holistic assessment of a
model's abilities. We evaluate various state-of-the-art LLMs as language agents
in a repository-editing system built on Qurrent OS, our language agent
development software. Despite their 1% pass@1 performance difference on
HumanEval, we find Claude Sonnet 3.5 outperforms GPT-4o by 12% pass@1 on RES-Q,
indicating RES-Q's capacity to differentiate model capability as traditional
benchmarks approach saturation. We further investigate token efficiency,
performance relationships with existing benchmarks, and interesting disparities
between closed and open-source LLMs. Code and dataset are available at
https://github.com/Qurrent-AI/RES-Q
Praktikum Hochspannungstechnik (S 8855): Theorie und Versuchsanleitungen
Moin!
Dieses hochspannende Skript soll euch auf die Versuch im Praktikum Hochspannungstechnik
vorbereiten. Inhalt dieses elektrisierenden Praktikums ist neben den Grundlagen der Hochspannungstechnik insbesondere das wissenschaftliche Arbeiten bei der Planung, Durchführung und Dokumentation von Versuchen. Und natürlich dürfen dabei Knistern, Überschläge und Blitze nicht fehlen!
Es gliedert sich in vier Teile, die jeweils Lernziele, Theorie und Versuchsbeschreibung für die vier
Teilversuche beinhalten. Die Theorie fasst die wichtigsten Aspekte der Hochspannungstechnik
für die Anwendung in den Versuchen zusammen. Weiterführende Informationen sind in den
Literaturangaben zu finden.
Wenn ihr Verbesserungsvorschläge beziehungsweise Änderungswünsche für dieses Skript habt oder euch Fehler auffallen sollten, schreibt uns bitte eine Mail oder weist uns an den Versuchstagen darauf hin.
Viel Spaß beim Lesen, Lernen und Laborieren wünscht euch
das IEE-Hochspannungstea
Developing a Framework for Population Health in Interprofessional Training: An Interprofessional Education Module
Interprofessional education (IPE) is based on the concept that health professional students are best trained on the skills, knowledge, and attitudes that promote population health when they learn with and about others from diverse health science fields. Previously, IPE has focused almost exclusively on the clinical context. This study piloted and evaluated an IPE learning experience that emphasizes population health in a sample of public health undergraduate students. We hypothesized that students who completed the 2-hour online asynchronous module would better understand the value of public health's role in interprofessional teams, the benefit of interprofessional teamwork in improving health outcomes, and the value of collaborative learning with other interprofessional students. Students engaged in pre- and post-training assessments and individual reflections throughout the module. Sixty-seven undergraduate public health students completed the module and assessments. After completion, a greater proportion strongly agreed that students from different health science disciplines should be educated in the same setting to form collaborative relationships with one another (19 vs. 39% before and after completion, respectively). A greater proportion also strongly agreed that care delivered by an interprofessional team would benefit the health outcomes of a patient/client after the training (60 vs. 75% before and after, respectively). Mean scores describing how strongly students agreed with the above two statements significantly increased post-training. A greater proportion of students strongly agreed that incorporating the public health discipline as part of an interprofessional team is crucial to address the social determinants of health for individual health outcomes after taking the training (40 vs. 55% before and after, respectively). There was little change in attitudes about the importance of incorporating public health as part of an interprofessional team to address social determinants of health for population health outcomes, which were strongly positive before the training. Most students reported being satisfied with the module presentation and felt their understanding of interprofessional practice improved. This training may be useful for students from all health disciplines to recognize the benefits of engaging with and learning from public health students and to recognize the important role of public health in interprofessional practices
The Simple View of Reading Made Complex by Morphological Decoding Fluency in Bilingual Fourth-Grade Readers of English
This is the author accepted manuscript. The final version is available from Wiley via the DOI in this recordThis study examined the complexity of the Simple View of Reading focusing on morphological
decoding fluency in fourth-grade readers of English in Singapore. The participants were three
groups of students who all learned to become bilingual and biliterate in the English language
(EL) and their respective ethnic language in school but differed in the home language they used.
The first group was ethnic Chinese students who used English as the dominant home language
(Chinese EL1); the other two groups were ethnic Chinese and Malay students whose dominant
home language was not English but Chinese (Chinese EL2) and Malay (Malay EL2),
respectively. The measures included pseudo word decoding (phonemic decoding), timed
decoding of derivational words (morphological decoding fluency), oral vocabulary, and passage
comprehension. Path analysis showed that oral vocabulary significantly predicted reading
comprehension across all three groups; yet a significant effect of morphological decoding
fluency surfaced in the Chinese EL1 and Malay EL2 groups but not the Chinese EL2 group.
Multi-group path analysis and commonality analysis further confirmed that morphological
decoding played a larger role in the in the Chinese EL1 and Malay EL2 groups. These findings
are discussed in light of the joint influence of target language experience and cross-linguistic
influence on second language or bilingual reading development.Office of Education Research, National Institute of Education, Nanyang Technological Universit
A Europe-wide inventory of citizen-led energy action with data from 29 countries and over 10000 initiatives
publishedVersio
Emilin1 gene and essential hypertension: a two-stage association study in northern Han Chinese population
<p>Abstract</p> <p>Background</p> <p>Elastogenesis of elastic extracellular matrix (ECM) which was recognized as a major component of blood vessels has been believed for a long time to play only a passive role in the dynamic vascular changes of typical hypertension. Emilin1 gene participated in the transcription of ECM's formation and was recognized to modulate links TGF-β maturation to blood pressure homeostasis in animal study. Recently relevant advances urge further researches to investigate the role of Emilin1 gene in regulating TGF-β signals involved in elastogenesis and vascular cell defects of essential hypertension (EH).</p> <p>Methods</p> <p>We designed a two-stage case-control study and selected three single nucleotide polymorphisms (SNPs), rs3754734, rs2011616 and rs2304682 from the HapMap database, which covered Emilin1 gene. Totally 2,586 subjects were recruited from the International Collaborative Study of Cardiovascular Disease in Asia (InterASIA). In stage 1, all the three SNPs of the Emilin1 gene were genotyped and tested within a subsample including 503 cases and 490 controls, significant SNPs would enter into stage 2 including 814 cases with hypertension and 779 controls and analyze on the basis of testing total 2,586 subjects.</p> <p>Results</p> <p>In stage 1, single locus analyses showed that SNPs rs3754734 and rs2011616 had significant association with EH (P < 0.05). In stage 2, weak association for dominant model were observed by age stratification and odds ratio (ORs) of TG+GG vs. TT of rs3754734 were 0.768 (0.584-1.009), 0.985 (0.735-1.320) and 1.346 (1.003-1.806) in < 50, 50-59 and ≥ 60 years group and ORs of GA+AA vs. GG of rs2011616 were 0.745 (0.568-0.977), 1.013 (0.758-1.353) and 1.437 (1.072-1.926) in < 50, 50-59 and ≥ 60 years group respectively. Accordingly, significant interactions were detected between genotypes of rs3754734 and rs2011616 and age for EH, and ORs were 1.758 (1.180-2.620), P = 0.006 and 1.903 (1.281-2.825), P = 0.001, respectively. Results of haplotypes analysis showed that there weren't any haplotypes associated with EH directly, but the interaction of hap2 (GA) and age-group found to be significant after being adjusted for the covariates, OR was 1.220 (1.031-1.444), P value was 0.020.</p> <p>Conclusion</p> <p>Our findings don't support positive association of Emilin1 gene with EH, but the interaction of age and genotype variation of rs3754734 and rs2011616 might increase the risk to hypertension.</p
The James Webb Space Telescope Mission
Twenty-six years ago a small committee report, building on earlier studies,
expounded a compelling and poetic vision for the future of astronomy, calling
for an infrared-optimized space telescope with an aperture of at least .
With the support of their governments in the US, Europe, and Canada, 20,000
people realized that vision as the James Webb Space Telescope. A
generation of astronomers will celebrate their accomplishments for the life of
the mission, potentially as long as 20 years, and beyond. This report and the
scientific discoveries that follow are extended thank-you notes to the 20,000
team members. The telescope is working perfectly, with much better image
quality than expected. In this and accompanying papers, we give a brief
history, describe the observatory, outline its objectives and current observing
program, and discuss the inventions and people who made it possible. We cite
detailed reports on the design and the measured performance on orbit.Comment: Accepted by PASP for the special issue on The James Webb Space
Telescope Overview, 29 pages, 4 figure
Improving the assessment and management of obesity in UK children and adolescents: the PROMISE research programme including a RCT
BackgroundFive linked studies were undertaken to inform identified evidence gaps in the childhood obesity pathway.Objectives(1) To scope the impact of the National Child Measurement Programme (NCMP) (study A). (2) To develop a brief evidence-based electronic assessment and management tool (study B). (3) To develop evidence-based algorithms for identifying the risk of obesity comorbidities (study B). (4) To conduct an efficacy trial of the Healthy Eating and Lifestyle Programme (HELP) (study C). (5) To improve the prescribing of anti-obesity drugs in UK adolescents (study D). (6) To investigate the safety, outcomes and predictors of outcome of adolescent bariatric surgery in the UK (study E).MethodsFive substudies – (1) a parental survey before and after feedback from the National Childhood Measurement Programme, (2) risk algorithm development and piloting of a new primary care management tool, (3) a randomised controlled trial of the Healthy Eating and Lifestyle Programme, (4) quantitative and qualitative studies of anti-obesity drug treatment in adolescents and (5) a prospective clinical audit and cost-effectiveness evaluation of adolescent bariatric surgery in one centre.ResultsStudy A – before the National Childhood Measurement Programme feedback, three-quarters of parents of overweight and obese children did not recognise their child to be overweight. Eighty-seven per cent of parents found the National Childhood Measurement Programme feedback to be helpful. Feedback had positive effects on parental knowledge, perceptions and intentions. Study B – risk estimation models for cardiovascular and psychosocial comorbidities of obesity require further development. An online consultation tool for primary care practitioners is acceptable and feasible. Study C – the Healthy Eating and Lifestyle Programme, when delivered in the community by graduate mental health workers, showed no significant effect on body mass index at 6 months (primary outcome) when compared with enhanced usual care. Study D – anti-obesity drugs appear efficacious in meta-analysis, and their use has expanded rapidly in the last decade. However, the majority of prescriptions are rapidly discontinued after 1–3 months of treatment. Few young people described positive experiences of anti-obesity drugs. Prescribing was rarely compliant with the National Institute for Health and Care Excellence guidance. Study E – bariatric surgery appears safe, effective and highly cost-effective in adolescents in the NHS.Future work and limitationsWork is needed to evaluate behaviour and body mass index change in the National Childhood Measurement Programme more accurately and improve primary care professionals’ understanding of the National Childhood Measurement Programme feedback, update and further evaluate the Computer-Assisted Treatment of CHildren (CATCH) tool, investigate delivery of weight management interventions to young people from deprived backgrounds and those with significant psychological distress and obtain longer-term data on anti-obesity drug use and bariatric surgery outcomes in adolescence.Trial registrationCurrent Controlled Trials ISRCTN99840111.FundingThis project was funded by the National Institute for Health Research (NIHR) Programme Grants for Applied Research programme and will be published in full inProgramme Grants for Applied Research; Vol. 8, No. 3. See the NIHR Journals Library website for further project information.</jats:sec
Ernst der Zweite, Herzog zu Sachsen-Gotha und Altenburg als Pfleger und Beschützer der Wissenschaft und Kunst
von Dr. August Bec
- …
