社区
Spark
帖子详情
中文版前4章,有需要的朋友拿去
野男孩
2016-08-01 09:04:41
本站的资源里直接搜就有。
懒得搜的就从下面传送门去我的blog,里面有链接:
鼠标点我哈:<Apache Spark Graph Processing>中文版前4章
...全文
435
回复
打赏
收藏
<Apache Spark Graph Processing>中文版前4章,有需要的朋友拿去
本站的资源里直接搜就有。 懒得搜的就从下面传送门去我的blog,里面有链接: 鼠标点我哈:中文版前4章
复制链接
扫一扫
分享
转发到动态
举报
AI
作业
写回复
配置赞助广告
用AI写文章
回复
切换为时间正序
请发表友善的回复…
发表回复
打赏红包
Apache
Spark
Graph
Processing
. pdf
Apache
Spark
Graph
Processing
. pdf 是用于
Spark
图计算的书,高大上必备
Apache
Spark
Graph
Processing
(PACKT,2015)
Apache
Spark
is the next standard of open-source cluster-computing engine for
processing
big data. Many practical computing problems concern large
graph
s, like the Web
graph
and various social networks. The scale of these
graph
s – in some cases billions of vertices, trillions of edges – poses challenges to their efficient
processing
.
Apache
Spark
Graph
X API combines the advantages of both data-parallel and
graph
-parallel systems by efficiently expressing
graph
computation within the
Spark
data-parallel framework. This book will teach the user to do
graph
ical programming in
Apache
Spark
, apart from an explanation of the entire process of
graph
ical data analysis. You will journey through the creation of
graph
s, its uses, its exploration and analysis and finally will also cover the conversion of
graph
elements into
graph
structures. This book begins with an introduction of the
Spark
system, its libraries and the Scala Build Tool. Using a hands-on approach, this book will quickly teach you how to install and leverage
Spark
interactively on the command line and in a standalone Scala program. Then, it presents all the methods for building
Spark
graph
s using illustrative network datasets. Next, it will walk you through the process of exploring, visualizing and analyzing different network characteristics. This book will also teach you how to transform raw datasets into a usable form. In addition, you will learn powerful operations that can be used to transform
graph
elements and
graph
structures. Furthermore, this book also teaches how to create custom
graph
operations that are tailored for specific needs with efficiency in mind. The later chapters of this book cover more advanced topics such as clustering
graph
s, implementing
graph
-parallel iterative algorithms and learning methods from
graph
data.
Apache
Spark
2 for Beginners [2016]
Apache
Spark
2.0 for Beginners English | ISBN: 1785885006 | 2016 | Key Features This book offers an easy introduction to the
Spark
framework published on the latest version of
Apache
Spark
2 Perform efficient data
processing
, machine learning and
graph
processing
using various
Spark
components A practical guide aimed at beginners to get them up and running with
Spark
Book Description
Spark
is one of the most widely-used large-scale data
processing
engines and runs extremely fast. It is a framework that has tools that are equally useful for application developers as well as data scientists. This book starts with the fundamentals of
Spark
2 and covers the core data
processing
framework and API, installation, and application development setup. Then the
Spark
programming model is introduced through real-world examples followed by
Spark
SQL programming with DataFrames. An introduction to
Spark
R is covered next. Later, we cover the charting and plotting features of Python in conjunction with
Spark
data
processing
. After that, we take a look at
Spark
's stream
processing
, machine learning, and
graph
processing
libraries. The last chapter combines all the skills you learned from the preceding chapters to develop a real-world
Spark
application. By the end of this book, you will have all the knowledge you need to develop efficient large-scale applications using
Apache
Spark
. What you will learn Get to know the fundamentals of
Spark
2 and the
Spark
programming model using Scala and Python Know how to use
Spark
SQL and DataFrames using Scala and Python Get an introduction to
Spark
programming using R Perform
Spark
data
processing
, charting, and plotting using Python Get acquainted with
Spark
stream
processing
using Scala and Python Be introduced to machine learning using
Spark
MLlib Get started with
graph
processing
using the
Spark
Graph
X Bring together all that you've learned and develop a complete
Spark
application
Apache
_
Spark
_
Graph
_
Processing
Apache
_
Spark
_
Graph
_
Processing
原版英文书
Mastering.
Apache
.
Spark
.178397146
About This Book Explore the integration of
Apache
Spark
with third party applications such as H20, Databricks and Titan Evaluate how Cassandra and Hbase can be used for storage An advanced guide with a combination of instructions and practical examples to extend the most up-to date
Spark
functionalities Who This Book Is For If you are a developer with some experience with
Spark
and want to strengthen your knowledge of how to get around in the world of
Spark
, then this book is ideal for you. Basic knowledge of Linux, Hadoop and
Spark
is assumed. Reasonable knowledge of Scala is expected. What You Will Learn Extend the tools available for
processing
and storage Examine clustering and classification using MLlib Discover
Spark
stream
processing
via Flume, HDFS Create a schema in
Spark
SQL, and learn how a
Spark
schema can be populated with data Study
Spark
based
graph
processing
using
Spark
Graph
X Combine
Spark
with H20 and deep learning and learn why it is useful Evaluate how
graph
storage works with
Apache
Spark
, Titan, HBase and Cassandra Use
Apache
Spark
in the cloud with Databricks and AWS In Detail
Apache
Spark
is an in-memory cluster based parallel
processing
system that provides a wide range of functionality like
graph
processing
, machine learning, stream
processing
and SQL. It operates at unprecedented speeds, is easy to use and offers a rich set of data transformations. This book aims to take your limited knowledge of
Spark
to the next level by teaching you how to expand
Spark
functionality. The book commences with an overview of the
Spark
eco-system. You will learn how to use MLlib to create a fully working neural net for handwriting recognition. You will then discover how stream
processing
can be tuned for optimal performance and to ensure parallel
processing
. The book extends to show how to incorporate H20 for machine learning, Titan for
graph
based storage, Databricks for cloud-based
Spark
. Intermediate Scala based code examples are provided for
Apache
Spark
module
processing
in a CentOS Linux and Databricks cloud environment. Table of Contents Chapter 1:
Apache
Spark
Chapter 2:
Apache
Spark
Mllib Chapter 3:
Apache
Spark
Streaming Chapter 4:
Apache
Spark
Sql Chapter 5:
Apache
Spark
Graph
x Chapter 6:
Graph
-Based Storage Chapter 7: Extending
Spark
With H2O Chapter 8:
Spark
Databricks Chapter 9: Databricks Visualization
Spark
1,273
社区成员
1,171
社区内容
发帖
与我相关
我的任务
Spark
Spark由Scala写成,是UC Berkeley AMP lab所开源的类Hadoop MapReduce的通用的并行计算框架,Spark基于MapReduce算法实现的分布式计算。
复制链接
扫一扫
分享
社区描述
Spark由Scala写成,是UC Berkeley AMP lab所开源的类Hadoop MapReduce的通用的并行计算框架,Spark基于MapReduce算法实现的分布式计算。
社区管理员
加入社区
获取链接或二维码
近7日
近30日
至今
加载中
查看更多榜单
社区公告
暂无公告
试试用AI创作助手写篇文章吧
+ 用AI写文章