202 research outputs found

    Promoting cold-start items in recommender systems

    Full text link
    As one of major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, so-called the item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. To our surprise, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs.Comment: 6 pages, 6 figure

    Emergence of scaling in human-interest dynamics

    Get PDF
    Human behaviors are often driven by human interests. Despite intense recent efforts in exploring the dynamics of human behaviors, little is known about human-interest dynamics, partly due to the extreme difficulty in accessing the human mind from observations. However, the availability of large-scale data, such as those from e-commerce and smart-phone communications, makes it possible to probe into and quantify the dynamics of human interest. Using three prototypical "big data" sets, we investigate the scaling behaviors associated with human-interest dynamics. In particular, from the data sets we uncover power-law scaling associated with the three basic quantities: (1) the length of continuous interest, (2) the return time of visiting certain interest, and (3) interest ranking and transition. We argue that there are three basic ingredients underlying human-interest dynamics: preferential return to previously visited interests, inertial effect, and exploration of new interests. We develop a biased random-walk model, incorporating the three ingredients, to account for the observed power-law scaling relations. Our study represents the first attempt to understand the dynamical processes underlying human interest, which has significant applications in science and engineering, commerce, as well as defense, in terms of specific tasks such as recommendation and human-behavior prediction

    Anchoring Bias in Online Voting

    Full text link
    Voting online with explicit ratings could largely reflect people's preferences and objects' qualities, but ratings are always irrational, because they may be affected by many unpredictable factors like mood, weather, as well as other people's votes. By analyzing two real systems, this paper reveals a systematic bias embedding in the individual decision-making processes, namely people tend to give a low rating after a low rating, as well as a high rating following a high rating. This so-called \emph{anchoring bias} is validated via extensive comparisons with null models, and numerically speaking, the extent of bias decays with interval voting number in a logarithmic form. Our findings could be applied in the design of recommender systems and considered as important complementary materials to previous knowledge about anchoring effects on financial trades, performance judgements, auctions, and so on.Comment: 5 pages, 4 tables, 5 figure

    Epidemic Spreading in Weighted Networks: An Edge-Based Mean-Field Solution

    Full text link
    Weight distribution largely impacts the epidemic spreading taking place on top of networks. This paper studies a susceptible-infected-susceptible model on regular random networks with different kinds of weight distributions. Simulation results show that the more homogeneous weight distribution leads to higher epidemic prevalence, which, unfortunately, could not be captured by the traditional mean-field approximation. This paper gives an edge-based mean-field solution for general weight distribution, which can quantitatively reproduce the simulation results. This method could find its applications in characterizing the non-equilibrium steady states of dynamical processes on weighted networks.Comment: 7 pages, 5 figure

    Predicting link directions via a recursive subgraph-based ranking

    Full text link
    Link directions are essential to the functionality of networks and their prediction is helpful towards a better knowledge of directed networks from incomplete real-world data. We study the problem of predicting the directions of some links by using the existence and directions of the rest of links. We propose a solution by first ranking nodes in a specific order and then predicting each link as stemming from a lower-ranked node towards a higher-ranked one. The proposed ranking method works recursively by utilizing local indicators on multiple scales, each corresponding to a subgraph extracted from the original network. Experiments on real networks show that the directions of a substantial fraction of links can be correctly recovered by our method, which outperforms either purely local or global methods.Comment: 6 pages, 5 figures; revised arguments for methods section; figures replotted; minor revision

    Relative clock demonstrates the endogenous heterogeneity of human dynamics

    Full text link
    The heavy-tailed inter-event time distributions are widely observed in many human-activated systems, which may result from both endogenous mechanisms like the highest-priority-first protocol and exogenous factors like the varying global activity versus time. To distinguish the effects on temporal statistics from different mechanisms is this of theoretical significance. In this Letter, we propose a new timing method by using a relative clock, where the time length between two consecutive events of an individual is counted as the number of other individuals' events appeared during this interval. We propose a model, in which agents act either in a constant rate or with a power-law inter-event time distribution, and the global activity either keeps unchanged or varies periodically versus time. Our analysis shows that the heavy tails caused by the heterogeneity of global activity can be eliminated by setting the relative clock, yet the heterogeneity due to real individual behaviors still exists. We perform extensive experiments on four large-scale systems, the search engine by AOL, a social bookmarking system--Delicious, a short-message communication network, and a microblogging system--Twitter. Strong heterogeneity and clear seasonality of global activity are observed, but the heavy tails cannot be eliminated by using the relative clock. Our results suggest the existence of endogenous heterogeneity of human dynamics.Comment: 6 pages 7 figures 2 Table
    corecore