309 research outputs found
Towards using web-crawled data for domain adaptation in statistical machine translation
This paper reports on the ongoing work focused on domain adaptation of statistical machine translation using domain-specific data obtained by domain-focused web crawling. We present a strategy for crawling monolingual and parallel data and their exploitation for testing, language modelling, and system tuning in a phrase--based machine translation framework. The proposed approach is evaluated on the domains of Natural Environment and Labour Legislation and two language
pairs: English–French and English–Greek
Spatially dispersive finite-difference time-domain analysis of sub-wavelength imaging by the wire medium slabs
In this paper, a spatially dispersive finite-difference time-domain (FDTD)
method to model wire media is developed and validated. Sub-wavelength imaging
properties of the finite wire medium slabs are examined. It is demonstrated
that the slab with its thickness equal to an integer number of half-wavelengths
is capable of transporting images with sub-wavelength resolution from one
interface of the slab to another. It is also shown that the operation of such
transmission devices is not sensitive to their transverse dimensions, which can
be made even comparable to the wavelength. In this case, the edge diffractions
are negligible and do not disturb the image formation.Comment: 14 pages, 13 figures, submitted to Optics Expres
Numerical and experimental time-domain characterization of terahertz conducting polymers
A comprehensive framework for the theoretical and experimental investigation of thin conducting films for terahertz applications is presented. The electromagnetic properties of conducting polymers spin-coated on low-loss dielectric substrates are characterized by means of terahertz time-domain spectroscopy and interpreted through the Drude-Smith model. The analysis is complemented by an advanced finite-difference time-domain algorithm, which rigorously deals both with the dispersive nature of the involved materials and the extremely subwavelength thickness of the conducting films. Significant agreement is observed among experimental measurements, numerical simulations, and theoretical results. The proposed approach provides a complete toolbox for the engineering of terahertz optoelectronic devices
D3.1. Architecture and design of the platform
This document aims to establish the requirements and the technological basis and design of the PANACEA platform. These are the main goals of the document: - Survey the different technological approaches that can be used in PANACEA. - Specify some guidelines for the metadata. - Establish the requirements for the platform. - Make a Common Interface proposal for the tools. - Propose a format for the data to be exchanged by the tools (Travelling Object). - Choose the technologies that will be used to develop the platform. - Propose a workplan
Third version (v4) of the integrated platform and documentation
The deliverable describes the third and final version of the PANACEA platform
Adquisición automática de recursos para traducción automática en el proyecto Abu-MaTran
This paper provides an overview of the research and development activities carried out to alleviate the language resources' bottleneck in machine translation within the Abu-MaTran project. We have developed a range of tools for the acquisition of the main resources required by the two most popular approaches to machine translation, i.e. statistical (corpora) and rule-based models (dictionaries and rules). All these tools have been released under open-source licenses and have been developed with the aim of being useful for industrial exploitation.Este artículo presenta una panorámica de las actividades de investigación y desarrollo destinadas a aliviar el cuello de botella que supone la falta de recursos lingüísticos en el campo de la traducción automática que se han llevado a cabo en el ámbito del proyecto Abu-MaTran. Hemos desarrollado un conjunto de herramientas para la adquisición de los principales recursos requeridos por las dos aproximaciones m as comunes a la traducción automática, modelos estadísticos (corpus) y basados en reglas (diccionarios y reglas). Todas estas herramientas han sido publicadas con licencias libres y han sido desarrolladas con el objetivo de ser útiles para ser explotadas en el ámbito comercial.The research leading to these results has received funding from the European Union Seventh Framework Programme FP7/2007-2013 under grant agreement PIAP-GA-2012-324414 (Abu-MaTran)
D6.1: Technologies and Tools for Lexical Acquisition
This report describes the technologies and tools to be used for Lexical Acquisition in PANACEA. It includes descriptions of existing technologies and tools which can be built on and improved within PANACEA, as well as of new technologies and tools to be developed and integrated in PANACEA platform. The report also specifies the Lexical Resources to be produced. Four main areas of lexical acquisition are included: Subcategorization frames (SCFs), Selectional Preferences (SPs), Lexical-semantic Classes (LCs), for both nouns and verbs, and Multi-Word Expressions (MWEs)
Business ethic and industrial marketing-fair trade and ethical business behavior as pylons of a success marketing plan in industry
- …
