ResearchHub | Open Science Community

ArnetMiner

Jie Tang et al.Aug 24, 2008

This paper addresses several key issues in the ArnetMiner system, which aims at extracting and mining academic social networks. Specifically, the system focuses on: 1) Extracting researcher profiles automatically from the Web; 2) Integrating the publication data into the network from existing digital libraries; 3) Modeling the entire academic network; and 4) Providing search services for the academic network. So far, 448,470 researcher profiles have been extracted using a unified tagging approach. We integrate publications from online Web databases and propose a probabilistic framework to deal with the name ambiguity problem. Furthermore, we propose a unified modeling approach to simultaneously model topical aspects of papers, authors, and publication venues. Search services such as expertise search and people association search have been provided based on the modeling results. In this paper, we describe the architecture and main features of the system. We also present the empirical evaluation of the proposed methods.

Artificial Intelligence

Information Systems

0

Paper

Artificial Intelligence

1,975

0

Save

0

Scalable Influence Maximization in Social Networks under the Linear Threshold Model

Wei Chen et al.Dec 1, 2010

Influence maximization is the problem of finding a small set of most influential nodes in a social network so that their aggregated influence in the network is maximized. In this paper, we study influence maximization in the linear threshold model, one of the important models formalizing the behavior of influence propagation in social networks. We first show that computing exact influence in general networks in the linear threshold model is #P-hard, which closes an open problem left in the seminal work on influence maximization by Kempe, Kleinberg, and Tardos, 2003. As a contrast, we show that computing influence in directed a cyclic graphs (DAGs) can be done in time linear to the size of the graphs. Based on the fast computation in DAGs, we propose the first scalable influence maximization algorithm tailored for the linear threshold model. We conduct extensive simulations to show that our algorithm is scalable to networks with millions of nodes and edges, is orders of magnitude faster than the greedy approximation algorithm proposed by Kempe et al. and its optimized versions, and performs consistently among the best algorithms while other heuristic algorithms not design specifically for the linear threshold model have unstable performances on different real-world networks.

Artificial Intelligence

Theoretical Computer Science

0

Paper

Artificial Intelligence

871

0

Save

0

Hybrid decision tree and naïve Bayes classifiers for multi-class classification tasks

Dewan Farid et al.Sep 11, 2013

In this paper, we introduce two independent hybrid mining algorithms to improve the classification accuracy rates of decision tree (DT) and naïve Bayes (NB) classifiers for the classification of multi-class problems. Both DT and NB classifiers are useful, efficient and commonly used for solving classification problems in data mining. Since the presence of noisy contradictory instances in the training set may cause the generated decision tree suffers from overfitting and its accuracy may decrease, in our first proposed hybrid DT algorithm, we employ a naïve Bayes (NB) classifier to remove the noisy troublesome instances from the training set before the DT induction. Moreover, it is extremely computationally expensive for a NB classifier to compute class conditional independence for a dataset with high dimensional attributes. Thus, in the second proposed hybrid NB classifier, we employ a DT induction to select a comparatively more important subset of attributes for the production of naïve assumption of class conditional independence. We tested the performances of the two proposed hybrid algorithms against those of the existing DT and NB classifiers respectively using the classification accuracy, precision, sensitivity-specificity analysis, and 10-fold cross validation on 10 real benchmark datasets from UCI (University of California, Irvine) machine learning repository. The experimental results indicate that the proposed methods have produced impressive results in the classification of real life challenging multi-class problems. They are also able to automatically extract the most valuable training datasets and identify the most effective attributes for the description of instances from noisy complex training databases with large dimensions of attributes.

Artificial Intelligence

Information Systems

0

Paper

Artificial Intelligence

328

0

Save

0

Neural Collaborative Filtering vs. Matrix Factorization Revisited

Steffen Rendle et al.Sep 19, 2020

Embedding based models have been the state of the art in collaborative filtering for over a decade. Traditionally, the dot product or higher order equivalents have been used to combine two or more embeddings, e.g., most notably in matrix factorization. In recent years, it was suggested to replace the dot product with a learned similarity e.g. using a multilayer perceptron (MLP). This approach is often referred to as neural collaborative filtering (NCF). In this work, we revisit the experiments of the NCF paper that popularized learned similarities using MLPs. First, we show that with a proper hyperparameter selection, a simple dot product substantially outperforms the proposed learned similarities. Second, while a MLP can in theory approximate any function, we show that it is non-trivial to learn a dot product with an MLP. Finally, we discuss practical issues that arise when applying MLP based similarities and show that MLPs are too costly to use for item recommendation in production environments while dot products allow to apply very efficient retrieval algorithms. We conclude that MLPs should be used with care as embedding combiner and that dot products might be a better default choice.

Artificial Intelligence

Information Systems

0

Paper

Artificial Intelligence

321

0

Save

0

Style Normalization and Restitution for Generalizable Person Re-Identification

Xin Jin et al.Jun 1, 2020

Existing fully-supervised person re-identification (ReID) methods usually suffer from poor generalization capability caused by domain gaps. The key to solving this problem lies in filtering out identity-irrelevant interference and learning domain-invariant person representations. In this paper, we aim to design a generalizable person ReID framework which trains a model on source domains yet is able to generalize/perform well on target domains. To achieve this goal, we propose a simple yet effective Style Normalization and Restitution (SNR) module. Specifically, we filter out style variations (e.g., illumination, color contrast) by Instance Normalization (IN). However, such a process inevitably removes discriminative information. We propose to distill identity-relevant feature from the removed information and restitute it to the network to ensure high discrimination. For better disentanglement, we enforce a dual causal loss constraint in SNR to encourage the separation of identity-relevant features and identity-irrelevant features. Extensive experiments demonstrate the strong generalization capability of our framework. Our models empowered by the SNR modules significantly outperform the state-of-the-art domain generalization approaches on multiple widely-used person ReID benchmarks, and also show superiority on unsupervised domain adaptation.

Artificial Intelligence

Law

0

Paper

Artificial Intelligence

314

0

Save

0

Learning similarity with cosine similarity ensemble

Peipei Xia et al.Feb 20, 2015

Artificial Intelligence

Computer Vision And Pattern Recognition

0

Paper

Artificial Intelligence

311

0

Save

0

Shadowed set approximations of L-fuzzy sets

Li Zhang et al.Jun 1, 2024

Artificial Intelligence

Applied Mathematics

0

Paper

Artificial Intelligence

2

0

Save

0

CAMixerSR: Only Details Need More “Attention”

Yan Wang et al.Jun 16, 2024

Radiation

Computer Vision And Pattern Recognition

0

Paper

Save

MIDF-DMAP: Multimodal information dynamic fusion for drug molecule activity prediction

Wenlong Yi et al.Sep 1, 2024

Philosophy

Artificial Intelligence

0

Paper

Philosophy

Artificial Intelligence

0

Save

0

Research on the Construction of English Translation Teaching Mode based on Deep Learning Model and Artificial Intelligence

Li ZhangMar 1, 2024

As globalization continuously deepens, the importance of English translation in various fields has become increasingly prominent. Therefore, how to improve the level of English translation teaching and cultivate more excellent translation talents has become an important task for English translation teaching in the new era. The rise of artificial intelligence technologyprovides new ideas and methods for college English translation teaching. The application of artificial intelligence has endowed college English translation teachingwith the ability to change traditional teaching methods and improve students’ learning efficiency. Based on this, this paper will discuss how artificial intelligence can empower college English translation teaching and how to improve students’ English translation ability through artificial intelligence technology support, expecting to provide some reference value for college English translation teaching and promote the development of English teaching.

Artificial Intelligence

Biochemistry

0

Paper

Artificial Intelligence

Biochemistry

0

Save