536 research outputs found

    VisualMRC: Machine Reading Comprehension on Document Images

    Full text link
    Recent studies on machine reading comprehension have focused on text-level understanding but have not yet reached the level of human understanding of the visual layout and content of real-world documents. In this study, we introduce a new visual machine reading comprehension dataset, named VisualMRC, wherein given a question and a document image, a machine reads and comprehends texts in the image to answer the question in natural language. Compared with existing visual question answering (VQA) datasets that contain texts in images, VisualMRC focuses more on developing natural language understanding and generation abilities. It contains 30,000+ pairs of a question and an abstractive answer for 10,000+ document images sourced from multiple domains of webpages. We also introduce a new model that extends existing sequence-to-sequence models, pre-trained with large-scale text corpora, to take into account the visual layout and content of documents. Experiments with VisualMRC show that this model outperformed the base sequence-to-sequence models and a state-of-the-art VQA model. However, its performance is still below that of humans on most automatic evaluation metrics. The dataset will facilitate research aimed at connecting vision and language understanding.Comment: Accepted as a full paper at AAAI 2021. The first two authors have equal contributio

    Development and Performance of Kyoto's X-ray Astronomical SOI pixel (SOIPIX) sensor

    Full text link
    We have been developing monolithic active pixel sensors, known as Kyoto's X-ray SOIPIXs, based on the CMOS SOI (silicon-on-insulator) technology for next-generation X-ray astronomy satellites. The event trigger output function implemented in each pixel offers microsecond time resolution and enables reduction of the non-X-ray background that dominates the high X-ray energy band above 5--10 keV. A fully depleted SOI with a thick depletion layer and back illumination offers wide band coverage of 0.3--40 keV. Here, we report recent progress in the X-ray SOIPIX development. In this study, we achieved an energy resolution of 300~eV (FWHM) at 6~keV and a read-out noise of 33~e- (rms) in the frame readout mode, which allows us to clearly resolve Mn-Kα\alpha and Kβ\beta. Moreover, we produced a fully depleted layer with a thickness of 500 μm500~{\rm \mu m}. The event-driven readout mode has already been successfully demonstrated.Comment: 7pages, 12figures, SPIE Astronomical Telescopes and Instrumentation 2014, Montreal, Quebec, Canada. appears as Proc. SPIE 9147, Space Telescopes and Instrumentation 2014: Ultraviolet to Gamma Ra

    InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions

    Full text link
    We study the problem of completing various visual document understanding (VDU) tasks, e.g., question answering and information extraction, on real-world documents through human-written instructions. To this end, we propose InstructDoc, the first large-scale collection of 30 publicly available VDU datasets, each with diverse instructions in a unified format, which covers a wide range of 12 tasks and includes open document types/formats. Furthermore, to enhance the generalization performance on VDU tasks, we design a new instruction-based document reading and understanding model, InstructDr, that connects document images, image encoders, and large language models (LLMs) through a trainable bridging module. Experiments demonstrate that InstructDr can effectively adapt to new VDU datasets, tasks, and domains via given instructions and outperforms existing multimodal LLMs and ChatGPT without specific training.Comment: Accepted by AAAI2024; project page: https://github.com/nttmdlab-nlp/InstructDo

    Cavernous hemangioma in the mediastinum: A case report focusing on multimodal image findings.

    Get PDF
    A 50-year-old female was admitted to Jichi Medical University Saitama Medical Center with a mediastinal tumor suspected on thoracic computed tomography (CT). The CT showed a well-demarcated, nodular lesion in the left anterior mediastinum, which was adjacent to the aortic arch. A contrast-enhanced dynamic CT study revealed spotty enhancement in the peripheral part of the tumor in the arterial phase; subsequently, the contrast effect progressively spread throughout the tumor in the delayed phase. On magnetic resonance imaging (MRI), the tumor had a homogeneously hyperintense signal on T2-weighted image. Diffusion-weighted image demonstrated high intensity in the tumor with a high apparent diffusion coefficient. A neurogenic tumor was suspected, and surgery was performed without a definitive preoperative diagnosis. The tumor was removed and was histologically diagnosed as a cavernous hemangioma. Here, we report the multimodality imaging findings of a patient with a cavernous hemangioma in the mediastinum. When a cavernous hemangioma is on the differential diagnosis of a mediastinal tumor, dynamic CT or MRI studies should be considered to facilitate making a preoperative diagnosis.departmental bulletin pape

    Amyloidomas on the bilateral buttocks associated with osteoarthropathy of the hips:a case report focusing on CT and MRI findings

    Get PDF
    A tumoral presentation of amyloidosis is uncommon in soft tissue. We here report a case of bulky amyloidomas on bilateral buttocks in a patient with osteoarthropathy of the hips. The 66-year-old patient’s history included 26 years of hemodialysis along with osteoarthropathy of the hips and a pathological fracture of the femoral neck. CT showed ill-defined bulky masses with a slightly higher density compared with that of muscles in the subcutaneous regions of the bilateral buttocks. Moderate contrast enhancement of the lesions was revealed. The masses contained diffuse tiny fat depositions and psammomatous calcifications, suggestive of amyloid deposition. On MRI, the masses demonstrated markedly hypointense signals on T2-weighted image, consistent with amyloidosis. Characteristic scattered spots of high intensity, suggesting fat or microcalcifications, were noted within the masses on T1-weighted image. Mass-like synovial thickening was observed in the bilateral hip joints with discrete marginal erosions of the femoral heads. The CT and MRI features of the hips were compatible with typical amyloid osteoarthropathy.departmental bulletin pape

    Sigle Agent of Posttransplant Cyclophosphamide Without Calcineurin Inhibitor Controls Severity of Experimental Chronic GVHD

    Get PDF
    Chronic graft-versus-host disease (GVHD) is a major cause of late death and morbidity following allogeneic hematopoietic cell transplantation (HCT), but its pathogenesis remains unclear. Recently, haplo-identical HCT with post-transplant cyclophosphamide (Haplo-HCT with PTCY) was found to achieve a low incidence rate of acute GVHD and chronic GVHD. However, while the pathogenesis of acute GVHD following Haplo-HCT with PTCY has been well investigated, that of chronic GVHD remains to be elucidated, especially in HLA-matched HCT with PTCY. Based on its safety profile, PTCY is currently applied for the human leucocyte antigen (HLA)-matched HCT setting. Here, we investigated the mechanisms of chronic GVHD following HLA-matched HCT with PTCY using a well-defined mouse chronic GVHD model. PTCY attenuated clinical and pathological chronic GVHD by suppressing effector T-cells and preserving regulatory T-cells compared with a control group. Additionally, we demonstrated that cyclosporine A (CsA) did not show any additional positive effects on attenuation of GVHD in PTCY-treated recipients. These results suggest that monotherapy with PTCY without CsA could be a promising strategy for the prevention of chronic GVHD following HLA-matched HCT

    Computational Fluid Dynamics in Unruptured Intracranial Aneurysms

    Get PDF
    Introduction and Objective: Intracranial aneurysm, also known as brain aneurysm, is a cerebrovascular disorder in which weakness in the wall of a cerebral artery causes a localized dilation or ballooning of the blood vessel. There is no objective way, device or tools, of predicting rupture of aneurysm so far. Computational fluid dynamics (CFDs) was proposed as a tool to identify the rupture risk. Purpose of study: To reveal the correlation of CFD findings with intraoperative microscopic findings and prove the relevance of CFDin the prediction of rupture risk and in the management of unruptured intracranial aneurysms. Subjects and Methods: A prospective cohort study was conducted inNeurosurgery department of Fujita Health University Banbuntane Hotokukai Hospital, Nagoya, Japanduring a 3?month period in 2018,from January to March, Ten patientswere diagnosed unruptured intracranial aneurysms (UIA). In diagnosis computed tomography (CT) angiogram, CFD and digital subtraction angiogram were included. Intraoperatively microscopic examination of the aneurysm wall was carried out and images recorded. The correlation between microscopic dome morphology and CFD information was performed. Results: Nine cases were found intraoperatively to have a higher risk of rupture based on the thinning of the wall. One cases had an atherosclerotic wall. All cases had low wall shear stress (WSS). In 90 % of cases Low WSS was able to predict the potency rupture risk in the near future. Conclusions: This study of CFD and its correlation with intraoperativefindings of the aneurysm suggested that low WSS of the aneurysm wall is associated with thin wall aneurysm and hence increased risk of aneurysm rupture. Thus CFD can be used to predict the risk of rupture of unruptured aneurysm and for planning of its treatment
    corecore