Finding Community Topics and Membership in Graphs

Community detection in networks is a broad problem with many proposed solutions. Existing methods frequently make use of edge density and node attributes; however, the methods ultimately have different definitions of community and build strong assumptions

PDF / 509,954 Bytes
16 Pages / 439.37 x 666.142 pts Page_size
66 Downloads / 238 Views

DOWNLOAD

REPORT

Abstract. Community detection in networks is a broad problem with many proposed solutions. Existing methods frequently make use of edge density and node attributes; however, the methods ultimately have different deﬁnitions of community and build strong assumptions about community features into their models. We propose a new method for community detection, which estimates both per-community feature distributions (topics) and per-node community membership. Communities are modeled as connected subgraphs with nodes sharing similar attributes. Nodes may join multiple communities and share common attributes with each. Communities have an associated probability distribution over attributes and node attributes are modeled as draws from a mixture distribution. We make two basic assumptions about community structure: communities are densely connected and have a small network diameter. These assumptions inform the estimation of community topics and membership assignments without being too prescriptive. We present competitive results against state-of-the-art methods for ﬁnding communities in networks constructed from NSF awards, the DBLP repository, and the Scratch online community.

1

Introduction

Given a graph of self-organizing objects, we wish to estimate the latent topics around which the objects organize and discover community membership. We hypothesize groups with high edge density in graphs are evidence of communities whose members have similar attributes within a subset of the feature dimensions. In this paper we present Seeded Estimation of Network Communities (SENC). SENC is a probabilistic method which uses both node attributes and graph structure to simultaneously estimate community feature distributions and members. We assume a community may exist around seed groups in the network. Many community detection methods build strong assumptions regarding community features into their models, which limits generalizability. SENC provides a ﬂexible means of accounting for a variety of community structures through the use of conﬁgurable lower and upper bounds on discovered communities. The seed groups deﬁne the lower bounds, and they may in turn be deﬁned by network structure or node and edge attributes. In the experiments presented in this paper, we consider every maximal k-clique in the network to be the core c Springer International Publishing Switzerland 2015 A. Appice et al. (Eds.): ECML PKDD 2015, Part II, LNAI 9285, pp. 625–640, 2015. DOI: 10.1007/978-3-319-23525-7 38

626

M. Revelle et al.

of a partially deﬁned community. The upper bounds provide an intuitive way to incorporate knowledge about the degree of clustering in the network. Nodes may be members of multiple communities and communities may overlap. Communities are deﬁned by the associated distribution (topic) and a set of member nodes. Every seed group corresponds to a community, and the initial feature distributions are a weighted average of the seed members’ attributes. We use the features of nodes in each group to compute initial estimates for the

Data Loading...

Finding Community Topics and Membership in Graphs

Recommend Documents

Topics in Domination in Graphs

Hot topics provide value to MRS membership and the materials community

Finding Dense Subgraphs in Relational Graphs

Finding a Dense-Core in Jellyfish Graphs

Investigating Large-Scale Graphs for Community Detection

Efficient Closest Community Search over Large Graphs

Fast and Accurate Community Search Algorithm for Attributed Graphs

SpEC: Sparse Embedding-Based Community Detection in Attributed Graphs

Membership Functions

Finding community and overcoming barriers: experiences of queer and transgender postsecondary students in mathematics an

Membership Communities

Approximate Membership