Search CORE

2,979 research outputs found

Startups and Stanford University

Author: Lebret Hervé
Publication venue
Publication date: 01/01/2017
Field of study

Startups have become in less than 50 years a major component of innovation and economic growth. Silicon Valley has been the place where the startup phenomenon was the most obvious and Stanford University was a major component of that success. Companies such as Google, Yahoo, Sun Microsystems, Cisco, Hewlett Packard had very strong links with Stanford but even these vary famous success stories cannot fully describe the richness and diversity of the Stanford entrepreneurial activity. This report explores the dynamics of more than 5000 companies founded by Stanford University alumni and staff, through their value creation, their field of activities, their growth patterns and more. The report also explores some features of the founders of these companies such as their academic background or the number of years between their Stanford experience and their company creation

arXiv.org e-Print Archive

Crossref

280 Birds with One Stone: Inducing Multilingual Taxonomies from Wikipedia using Character-level Classification

Author: Aberer Karl
Gupta Amit
Harkous Hamza
Lebret Rémi
Publication venue
Publication date: 05/05/2017
Field of study

We propose a simple, yet effective, approach towards inducing multilingual taxonomies from Wikipedia. Given an English taxonomy, our approach leverages the interlanguage links of Wikipedia followed by character-level classifiers to induce high-precision, high-coverage taxonomies in other languages. Through experiments, we demonstrate that our approach significantly outperforms the state-of-the-art, heuristics-heavy approaches for six languages. As a consequence of our work, we release presumably the largest and the most accurate multilingual taxonomic resource spanning over 280 languages

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Association for the Advancement of Artificial Intelligence: AAAI Publications

Phrase-based Image Captioning

Author: Collobert Ronan
Lebret Rémi
Pinheiro Pedro O.
Publication venue
Publication date: 09/04/2015
Field of study

Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image. This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representation (generated from a previously trained Convolutional Neural Network) and phrases that are used to described them. The system is then able to infer phrases from a given image sample. Based on caption syntax statistics, we propose a simple language model that can produce relevant descriptions for a given test image using the phrases inferred. Our approach, which is considerably simpler than state-of-the-art models, achieves comparable results in two popular datasets for the task: Flickr30k and the recently proposed Microsoft COCO

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Polisis: Automated Analysis and Presentation of Privacy Policies Using Deep Learning

Author: Aberer Karl
Fawaz Kassem
Harkous Hamza
Lebret Rémi
Schaub Florian
Shin Kang G.
Publication venue
Publication date: 29/06/2018
Field of study

Privacy policies are the primary channel through which companies inform users about their data collection and sharing practices. These policies are often long and difficult to comprehend. Short notices based on information extracted from privacy policies have been shown to be useful but face a significant scalability hurdle, given the number of policies and their evolution over time. Companies, users, researchers, and regulators still lack usable and scalable tools to cope with the breadth and depth of privacy policies. To address these hurdles, we propose an automated framework for privacy policy analysis (Polisis). It enables scalable, dynamic, and multi-dimensional queries on natural language privacy policies. At the core of Polisis is a privacy-centric language model, built with 130K privacy policies, and a novel hierarchy of neural-network classifiers that accounts for both high-level aspects and fine-grained details of privacy practices. We demonstrate Polisis' modularity and utility with two applications supporting structured and free-form querying. The structured querying application is the automated assignment of privacy icons from privacy policies. With Polisis, we can achieve an accuracy of 88.4% on this task. The second application, PriBot, is the first freeform question-answering system for privacy policies. We show that PriBot can produce a correct answer among its top-3 results for 82% of the test questions. Using an MTurk user study with 700 participants, we show that at least one of PriBot's top-3 answers is relevant to users for 89% of the test questions.Comment: Published at USENIX Security 2018; associated website: https://pribot.or

Infoscience - École polytechnique fédérale de Lausanne

arXiv.org e-Print Archive

Generating Video Descriptions with Topic Guidance

Author: Ba Lei Jimmy
Hinton Geoffrey
Lebret Rémi
Lin Chin-Yew
Venugopalan Subhashini
Publication venue
Publication date: 04/09/2017
Field of study

Generating video descriptions in natural language (a.k.a. video captioning) is a more challenging task than image captioning as the videos are intrinsically more complicated than images in two aspects. First, videos cover a broader range of topics, such as news, music, sports and so on. Second, multiple topics could coexist in the same video. In this paper, we propose a novel caption model, topic-guided model (TGM), to generate topic-oriented descriptions for videos in the wild via exploiting topic information. In addition to predefined topics, i.e., category tags crawled from the web, we also mine topics in a data-driven way based on training captions by an unsupervised topic mining model. We show that data-driven topics reflect a better topic schema than the predefined topics. As for testing video topic prediction, we treat the topic mining model as teacher to train the student, the topic prediction model, by utilizing the full multi-modalities in the video especially the speech modality. We propose a series of caption models to exploit topic guidance, including implicitly using the topics as input features to generate words related to the topic and explicitly modifying the weights in the decoder with topics to function as an ensemble of topic-aware language decoders. Our comprehensive experimental results on the current largest video caption dataset MSR-VTT prove the effectiveness of our topic-guided model, which significantly surpasses the winning performance in the 2016 MSR video to language challenge.Comment: Appeared at ICMR 201

arXiv.org e-Print Archive

Crossref

Rôle des médias audiovisuels, de la téléphonie mobile, de la publicité et des documents imprimés dans la gestion des PME agroalimentaires

Author: Broutin Cecile
Lebret Marie-Christine
Publication venue: Technical Centre for Agricultural and Rural Cooperation
Publication date: 31/03/2015
Field of study

Cette communication est organisée sous deux angles : 1. l’utilisation des médias audiovisuels, de la téléphonie mobile, de la publicité et des documents imprimés pour véhiculer de l’information vers les entreprises et les aider à améliorer leurs performances, 2. leur utilisation par les entreprises pour promouvoir leurs produits et améliorer leur efficacité commerciale.Cette communication met l'accent sur l’utilisation des médias et autres supports de communication pour véhiculer de l’information vers les entreprises et les aider à améliorer leurs performances et leur efficacité commerciale..

CGSpace (CGIAR)

A GAIN SCHEDULED CONTROL LAW FOR FIN/RUDDER ROLL STABILISATION OF SHIPS

Author: Lebret Guy
Tanguy Hervé
Publication venue: HAL CCSD
Publication date: 07/07/2004
Field of study

International audienceTaking into account the variations of the environment is a means of improving performances of roll stabilisation systems. The ship behaviour is modelled as a MIMO LPV system. A methodology is presented which leads to a gain-scheduled control law. The synthesis is based on multi objective optimisation, and on the representation of the standard system as a polytopical system, which depends on ship speed and on a stabilisation quality factor. Simulation results are given