2,967 research outputs found

    Examining Forgetting in Continual Pre-training of Aligned Large Language Models

    Full text link
    Recent advances in Large Language Models (LLMs) have exhibited remarkable proficiency across various tasks. Given the potent applications of LLMs in numerous fields, there has been a surge in LLM development. In developing LLMs, a common practice involves continual pre-training on previously fine-tuned models. However, this can lead to catastrophic forgetting. In our work, we investigate the phenomenon of forgetting that occurs during continual pre-training on an existing fine-tuned LLM. We evaluate the impact of continuous pre-training on the fine-tuned LLM across various dimensions, including output format, knowledge, and reliability. Experiment results highlight the non-trivial challenge of addressing catastrophic forgetting during continual pre-training, especially the repetition issue.Comment: Work in progres

    In situ observation of self-patterned defect formation in atomic superfluids -- from ring dark solitons to vortex dipole necklaces

    Get PDF
    Unveiling non-equilibrium dynamics of solitonic and topological defect structures in a multidimensional nonlinear medium is a current frontier across diverse fields. One of the quintessential objects is a ring dark soliton (RDS), whose dynamics are expected to display remarkable interplay between symmetry and self-patterned topological defect formation from a transverse (snake) instability but has thus far evaded full experimental observations. Here, we report an experimental realization of RDS generation in a two-dimensional atomic superfluid trapped in a circular box. By quenching the confining box potential, we observe spontaneous soliton emission from the edge and its peculiar signature in radial motion. As an RDS evolves, we observe spontaneous transverse modulations at discrete azimuthal angles, which clearly result in a patterned formation of a circular vortex dipole array. Through collisions of the vortex dipoles with the box trap, we observe vortex unbinding, vortex pinning to the edge, and emission of rarefaction pulses. Our box-quench protocol opens a new way to study multidimensional dark solitons, structured formation of topological defects, and potentially the dynamics of ordered quantum vortex matter
    corecore