1,082 research outputs found

    Second generation Dirac cones and inversion symmetry breaking induced gaps in graphene/hexagonal boron nitride

    Get PDF
    Graphene/h-BN has emerged as a model van der Waals heterostructure, and the band structure engineering by the superlattice potential has led to various novel quantum phenomena including the self-similar Hofstadter butterfly states. Although newly generated second generation Dirac cones (SDCs) are believed to be crucial for understanding such intriguing phenomena, so far fundamental knowledge of SDCs in such heterostructure, e.g. locations and dispersion of SDCs, the effect of inversion symmetry breaking on the gap opening, still remains highly debated due to the lack of direct experimental results. Here we report first direct experimental results on the dispersion of SDCs in 0^\circ aligned graphene/h-BN heterostructure using angle-resolved photoemission spectroscopy. Our data reveal unambiguously SDCs at the corners of the superlattice Brillouin zone, and at only one of the two superlattice valleys. Moreover, gaps of \approx 100 meV and \approx 160 meV are observed at the SDCs and the original graphene Dirac cone respectively. Our work highlights the important role of a strong inversion symmetry breaking perturbation potential in the physics of graphene/h-BN, and fills critical knowledge gaps in the band structure engineering of Dirac fermions by a superlattice potential.Comment: Nature Physics 2016, In press, Supplementary Information include

    Thermodynamic entropy as an indicator for urban sustainability?

    Get PDF
    As foci of economic activity, resource consumption, and the production of material waste and pollution, cities represent both a major hurdle and yet also a source of great potential for achieving the goal of sustainability. Motivated by the desire to better understand and measure sustainability in quantitative terms we explore the applicability of thermodynamic entropy to urban systems as a tool for evaluating sustainability. Having comprehensively reviewed the application of thermodynamic entropy to urban systems we argue that the role it can hope to play in characterising sustainability is limited. We show that thermodynamic entropy may be considered as a measure of energy efficiency, but must be complimented by other indices to form part of a broader measure of urban sustainability

    Towards Trustworthy Dataset Distillation

    Full text link
    Efficiency and trustworthiness are two eternal pursuits when applying deep learning in real-world applications. With regard to efficiency, dataset distillation (DD) endeavors to reduce training costs by distilling the large dataset into a tiny synthetic dataset. However, existing methods merely concentrate on in-distribution (InD) classification in a closed-world setting, disregarding out-of-distribution (OOD) samples. On the other hand, OOD detection aims to enhance models' trustworthiness, which is always inefficiently achieved in full-data settings. For the first time, we simultaneously consider both issues and propose a novel paradigm called Trustworthy Dataset Distillation (TrustDD). By distilling both InD samples and outliers, the condensed datasets are capable to train models competent in both InD classification and OOD detection. To alleviate the requirement of real outlier data and make OOD detection more practical, we further propose to corrupt InD samples to generate pseudo-outliers and introduce Pseudo-Outlier Exposure (POE). Comprehensive experiments on various settings demonstrate the effectiveness of TrustDD, and the proposed POE surpasses state-of-the-art method Outlier Exposure (OE). Compared with the preceding DD, TrustDD is more trustworthy and applicable to real open-world scenarios. Our code will be publicly available.Comment: 20 pages, 20 figure

    MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution

    Full text link
    Although Vision Transformers (ViTs) have recently advanced computer vision tasks significantly, an important real-world problem was overlooked: adapting to variable input resolutions. Typically, images are resized to a fixed resolution, such as 224x224, for efficiency during training and inference. However, uniform input size conflicts with real-world scenarios where images naturally vary in resolution. Modifying the preset resolution of a model may severely degrade the performance. In this work, we propose to enhance the model adaptability to resolution variation by optimizing the patch embedding. The proposed method, called Multi-Scale Patch Embedding (MSPE), substitutes the standard patch embedding with multiple variable-sized patch kernels and selects the best parameters for different resolutions, eliminating the need to resize the original image. Our method does not require high-cost training or modifications to other parts, making it easy to apply to most ViT models. Experiments in image classification, segmentation, and detection tasks demonstrate the effectiveness of MSPE, yielding superior performance on low-resolution inputs and performing comparably on high-resolution inputs with existing methods

    A Novel Clustering Tree-based Video lookup Strategy for Supporting VCR-like Operations in MANETs

    Get PDF
    Mobile Peer-to-Peer (MP2P) network is a promising avenue for large-scale deployment of Video-on-Demand (VoD) applications over mobile ad-hoc networks (MANETs). In P2P VoD systems, fast search for resources is key determinants for improving the Quality of Service (QoS) due to the low delay of seeking resources caused by streaming interactivity. In this paper, we propose a novel Clustering Tree-based Video Lookup strategy for supporting VCR-like operations in MANETs (CTVL) CTVL selects the chunks with the high popularity as "overlay router" chunks to build the "virtual connection" with other chunks in terms of the popularities and external connection of video chunks. CTVL designs a new clustering strategy to group nodes in P2P networks and a maintenance mechanism of cluster structure, which achieves the high system scalability and fast resource search performance. Thorough simulation results also show how CTVL achieves higher average lookup success rate, lower maintenance cost, lower average end-to-end delay and lower packet loss ratio (PLR) in comparison with other state of the art solutions

    WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models

    Full text link
    Segmenting and recognizing diverse object parts is crucial in computer vision and robotics. Despite significant progress in object segmentation, part-level segmentation remains underexplored due to complex boundaries and scarce annotated data. To address this, we propose a novel Weakly-supervised Part Segmentation (WPS) setting and an approach called WPS-SAM, built on the large-scale pre-trained vision foundation model, Segment Anything Model (SAM). WPS-SAM is an end-to-end framework designed to extract prompt tokens directly from images and perform pixel-level segmentation of part regions. During its training phase, it only uses weakly supervised labels in the form of bounding boxes or points. Extensive experiments demonstrate that, through exploiting the rich knowledge embedded in pre-trained foundation models, WPS-SAM outperforms other segmentation models trained with pixel-level strong annotations. Specifically, WPS-SAM achieves 68.93% mIOU and 79.53% mACC on the PartImageNet dataset, surpassing state-of-the-art fully supervised methods by approximately 4% in terms of mIOU

    Active Generalized Category Discovery

    Full text link
    Generalized Category Discovery (GCD) is a pragmatic and challenging open-world task, which endeavors to cluster unlabeled samples from both novel and old classes, leveraging some labeled data of old classes. Given that knowledge learned from old classes is not fully transferable to new classes, and that novel categories are fully unlabeled, GCD inherently faces intractable problems, including imbalanced classification performance and inconsistent confidence between old and new classes, especially in the low-labeling regime. Hence, some annotations of new classes are deemed necessary. However, labeling new classes is extremely costly. To address this issue, we take the spirit of active learning and propose a new setting called Active Generalized Category Discovery (AGCD). The goal is to improve the performance of GCD by actively selecting a limited amount of valuable samples for labeling from the oracle. To solve this problem, we devise an adaptive sampling strategy, which jointly considers novelty, informativeness and diversity to adaptively select novel samples with proper uncertainty. However, owing to the varied orderings of label indices caused by the clustering of novel classes, the queried labels are not directly applicable to subsequent training. To overcome this issue, we further propose a stable label mapping algorithm that transforms ground truth labels to the label space of the classifier, thereby ensuring consistent training across different active selection stages. Our method achieves state-of-the-art performance on both generic and fine-grained datasets. Our code is available at https://github.com/mashijie1028/ActiveGCDComment: Accepted to CVPR 202

    POS: A Prompts Optimization Suite for Augmenting Text-to-Video Generation

    Full text link
    This paper targets to enhance the diffusion-based text-to-video generation by improving the two input prompts, including the noise and the text. Accommodated with this goal, we propose POS, a training-free Prompt Optimization Suite to boost text-to-video models. POS is motivated by two observations: (1) Video generation shows instability in terms of noise. Given the same text, different noises lead to videos that differ significantly in terms of both frame quality and temporal consistency. This observation implies that there exists an optimal noise matched to each textual input; To capture the potential noise, we propose an optimal noise approximator to approach the potential optimal noise. Particularly, the optimal noise approximator initially searches a video that closely relates to the text prompt and then inverts it into the noise space to serve as an improved noise prompt for the textual input. (2) Improving the text prompt via LLMs often causes semantic deviation. Many existing text-to-vision works have utilized LLMs to improve the text prompts for generation enhancement. However, existing methods often neglect the semantic alignment between the original text and the rewritten one. In response to this issue, we design a semantic-preserving rewriter to impose contraints in both rewritng and denoising phrases to preserve the semantic consistency. Extensive experiments on popular benchmarks show that our POS can improve the text-to-video models with a clear margin. The code will be open-sourced

    Multi-objective Dwarf Mongoose Optimization Algorithm with Leader Guidance and Dominated Solution Evolution Mechanism

    Get PDF
    In the face of the increasingly complex multi-objective optimization problems, it is necessary to develop novel multi-objective optimization algorithms to meet the challenges. This paper proposes a multi-objective dwarf mongoose optimization algorithm (MODMO) with leader guidance and dominated solution dynamic reduction evolution mechanism. In the leader guidance mechanism, a dynamic trade-off factor is introduced to regulate the search radius of the scout mongoose exploring the mound. At the same time, an external archive is constructed with a non-inferior solution set and the leader is determined according to the non-dominated ranking level, and then the scout mongoose is guided to advance to the multi-objective frontier to improve the convergence of the algorithm. The dominant solution dynamic reduction evolution strategy is constructed to overcome the redundancy problem in the process of maintaining the external archive of non-inferior solutions. It dynamically selects the dominant solutions based on the dominance relationship and crowding distance and stores them in the external archive. The dominant solution information is integrated into the population evolution to realize the mining of multi-objective potential frontier and enhance the diversity of the algorithm. Compared with five representative algorithms on ZDT, DTLZ and WFG benchmark functions, experimental results show that MODMO algorithm has significant advantages in convergence and diversity
    corecore