309 research outputs found

    Towards using web-crawled data for domain adaptation in statistical machine translation

    Get PDF
    This paper reports on the ongoing work focused on domain adaptation of statistical machine translation using domain-specific data obtained by domain-focused web crawling. We present a strategy for crawling monolingual and parallel data and their exploitation for testing, language modelling, and system tuning in a phrase--based machine translation framework. The proposed approach is evaluated on the domains of Natural Environment and Labour Legislation and two language pairs: English–French and English–Greek

    Spatially dispersive finite-difference time-domain analysis of sub-wavelength imaging by the wire medium slabs

    Get PDF
    In this paper, a spatially dispersive finite-difference time-domain (FDTD) method to model wire media is developed and validated. Sub-wavelength imaging properties of the finite wire medium slabs are examined. It is demonstrated that the slab with its thickness equal to an integer number of half-wavelengths is capable of transporting images with sub-wavelength resolution from one interface of the slab to another. It is also shown that the operation of such transmission devices is not sensitive to their transverse dimensions, which can be made even comparable to the wavelength. In this case, the edge diffractions are negligible and do not disturb the image formation.Comment: 14 pages, 13 figures, submitted to Optics Expres

    Numerical and experimental time-domain characterization of terahertz conducting polymers

    Get PDF
    A comprehensive framework for the theoretical and experimental investigation of thin conducting films for terahertz applications is presented. The electromagnetic properties of conducting polymers spin-coated on low-loss dielectric substrates are characterized by means of terahertz time-domain spectroscopy and interpreted through the Drude-Smith model. The analysis is complemented by an advanced finite-difference time-domain algorithm, which rigorously deals both with the dispersive nature of the involved materials and the extremely subwavelength thickness of the conducting films. Significant agreement is observed among experimental measurements, numerical simulations, and theoretical results. The proposed approach provides a complete toolbox for the engineering of terahertz optoelectronic devices

    D3.1. Architecture and design of the platform

    Get PDF
    This document aims to establish the requirements and the technological basis and design of the PANACEA platform. These are the main goals of the document: - Survey the different technological approaches that can be used in PANACEA. - Specify some guidelines for the metadata. - Establish the requirements for the platform. - Make a Common Interface proposal for the tools. - Propose a format for the data to be exchanged by the tools (Travelling Object). - Choose the technologies that will be used to develop the platform. - Propose a workplan

    Third version (v4) of the integrated platform and documentation

    Get PDF
    The deliverable describes the third and final version of the PANACEA platform

    Adquisición automática de recursos para traducción automática en el proyecto Abu-MaTran

    Get PDF
    This paper provides an overview of the research and development activities carried out to alleviate the language resources' bottleneck in machine translation within the Abu-MaTran project. We have developed a range of tools for the acquisition of the main resources required by the two most popular approaches to machine translation, i.e. statistical (corpora) and rule-based models (dictionaries and rules). All these tools have been released under open-source licenses and have been developed with the aim of being useful for industrial exploitation.Este artículo presenta una panorámica de las actividades de investigación y desarrollo destinadas a aliviar el cuello de botella que supone la falta de recursos lingüísticos en el campo de la traducción automática que se han llevado a cabo en el ámbito del proyecto Abu-MaTran. Hemos desarrollado un conjunto de herramientas para la adquisición de los principales recursos requeridos por las dos aproximaciones m as comunes a la traducción automática, modelos estadísticos (corpus) y basados en reglas (diccionarios y reglas). Todas estas herramientas han sido publicadas con licencias libres y han sido desarrolladas con el objetivo de ser útiles para ser explotadas en el ámbito comercial.The research leading to these results has received funding from the European Union Seventh Framework Programme FP7/2007-2013 under grant agreement PIAP-GA-2012-324414 (Abu-MaTran)

    D6.1: Technologies and Tools for Lexical Acquisition

    Get PDF
    This report describes the technologies and tools to be used for Lexical Acquisition in PANACEA. It includes descriptions of existing technologies and tools which can be built on and improved within PANACEA, as well as of new technologies and tools to be developed and integrated in PANACEA platform. The report also specifies the Lexical Resources to be produced. Four main areas of lexical acquisition are included: Subcategorization frames (SCFs), Selectional Preferences (SPs), Lexical-semantic Classes (LCs), for both nouns and verbs, and Multi-Word Expressions (MWEs)
    corecore