Record:   Prev Next
作者 Srinivasa, K.G., author
書名 Guide to high performance distributed computing : case studies with Hadoop, Scalding and Spark / by K.G. Srinivasa, Anil Kumar Muppalla
出版項 Cham : Springer International Publishing : Imprint: Springer, 2015
國際標準書號 9783319134970 (electronic bk.)
9783319134963 (paper)
國際標準號碼 10.1007/978-3-319-13497-0 doi
book jacket
說明 1 online resource (xvii, 304 pages) : illustrations, digital ; 24 cm
text txt rdacontent
computer c rdamedia
online resource cr rdacarrier
text file PDF rda
系列 Computer communications and networks, 1617-7975
Computer communications and networks
附註 Part I: Programming Fundamentals of High Performance Distributed Computing -- Introduction -- Getting Started with Hadoop -- Getting Started with Spark -- Programming Internals of Scalding and Spark -- Part II: Case studies using Hadoop, Scalding and Spark -- Case Study I: Data Clustering using Scalding and Spark -- Case Study II: Data Classification using Scalding and Spark -- Case Study III: Regression Analysis using Scalding and Spark -- Case Study IV: Recommender System using Scalding and Spark
This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies such as Hadoop, Scalding and Spark. Comprehensive in scope, the book presents state-of-the-art material on building high performance distributed computing systems, providing practical guidance and best practices as well as describing theoretical software frameworks. Topics and features: Describes the fundamentals of building scalable software systems for large-scale data processing in the new paradigm of high performance distributed computing Presents an overview of the Hadoop ecosystem, followed by step-by-step instruction on its installation, programming and execution Reviews the basics of Spark, including resilient distributed datasets, and examines Hadoop streaming and working with Scalding Provides detailed case studies on approaches to clustering, data classification and regression analysis Explains the process of creating a working recommender system using Scalding and Spark Supplies a complete list of supplementary source code and datasets at an associated website Fulfilling the need for both introductory material for undergraduate students of computer science and detailed discussions for software engineering professionals, this book will aid a broad audience to understand the esoteric aspects of practical high performance computing through its use of solved problems, research case studies and working source code. K.G. Srinivasa is Professor and Head of the Department of Computer Science and Engineering at M.S. Ramaiah Institute of Technology (MSRIT), Bangalore, India. His other publications include the Springer title Soft Computing for Data Mining Applications. Anil Kumar Muppalla is also a researcher at MSRIT
Host Item Springer eBooks
主題 Apache Hadoop
SPARK (Electronic resource)
High performance computing -- Case studies
Electronic data processing -- Distributed processing -- Case studies
Computer Science
Computer Communication Networks
Programming Techniques
Data Mining and Knowledge Discovery
Artificial Intelligence (incl. Robotics)
Image Processing and Computer Vision
Alt Author Muppalla, Anil Kumar, author
SpringerLink (Online service)
Record:   Prev Next