Record:   Prev Next
Author Agarwal, Nitin
Title Modeling and data mining in blogosphere [electronic resource] / Nitin Agarwal, Huan Liu
Imprint San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool Publishers, c2009
book jacket
Descript 1 electronic text (x, 101 p. : ill.) : digital file
Series Synthesis lectures on data mining and knowledge discovery ; # 1
Synthesis lectures on data mining and knowledge discovery (Online) ; # 1
Note Part of: Synthesis digital library of engineering and computer science
Title from PDF t.p. (viewed on August 9, 2009)
Series from website
Includes bibliographical references (p. 87-93) and index
Modeling blogosphere -- Modeling essentials -- Preferential attachment blog models -- Log-normal distribution models -- Blog clustering and community discovery -- Graph based approach -- Content based approach -- Hybrid approach -- Influence and trust -- Influence -- Graph based approach -- Content based approach -- Hybrid approach -- Blog leaders -- Trust -- Trust computation -- Trust propagation -- Spam filtering in blogosphere -- Graph based approach -- Content based approach -- Hybrid approach -- Data collection and evaluation -- Data collection -- API -- Web crawler -- Available datasets -- Data preprocessing -- Evaluation -- Blog modeling -- Blog clustering and community discovery -- Influence and trust -- Spam -- Tools in blogosphere -- API examples
Abstract freely available; full-text restricted to subscribers or individual document purchasers
Google scholar
Google book search
Mode of access: World Wide Web
System requirements: Adobe Acrobat reader
This book offers a comprehensive overview of the various concepts and research issues about blogs or weblogs. It introduces techniques and approaches, tools and applications, and evaluation methodologies with examples and case studies. Blogs allow people to express their thoughts, voice their opinions, and share their experiences and ideas. Blogs also facilitate interactions among individuals creating a network with unique characteristics. Through the interactions individuals experience a sense of community.We elaborate on approaches that extract communities and cluster blogs based on information of the bloggers. Open standards and low barrier to publication in Blogosphere have transformed information consumers to producers, generating an overwhelming amount of ever-increasing knowledge about the members, their environment and symbiosis.We elaborate on approaches that sift through humongous blog data sources to identify influential and trustworthy bloggers leveraging content and network information. Spam blogs or splogs is an increasing concern in Blogosphere, which is discussed in detail with the approaches leveraging supervised machine learning algorithms and interaction patterns.We elaborate on data collection procedures, provide resources for blog data repositories, mention various visualization and analysis tools in Blogosphere, and explain conventional and novel evaluation methodologies, to help perform research in the Blogosphere. The book is supported by additional material, including lecture slides as well as the complete set of figures used in the book, and the reader is encouraged to visit the book website for the latest information:
Also available in print
Subject Blogs -- Mathematical models
Data mining
Blog model
Power law distribution
Scale free networks
Degree distribution
Clustering coefficient
Centrality measures
Community discovery
Spam blogs
Data collection
Blog crawling
Performance evaluation
Alt Author Liu, Huan, 1958-
Vari Title Synthesis digital library of engineering and computer science
Record:   Prev Next