版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
Date
|
City,
Country虚拟化⼤大数据的⼤大规模应⽤用
IT
at
theSpeed
of
Business*
Source:
Gartner,
2013:
“Hunting
and
Harvesting
in
a
Digital
World:
The
2013
CIO
Agenda”IT
at
the
speed
of
business
IT
Technology
ErasMainframeClient-ServerMobile-Cloud$$$$$$Steady
ITBudgets*
Business
ExpectationsAbility
of
IT
to
Deliver23Software-Defined
Data
CenterAgility,
Efficiency,
Control
and
Choice
in
the
Mobile-Cloud
EraAll
infrastructure
is
virtualized
and
deliveredas
a
service,
enabling
comprehensive
datacenter
management
through
software
and
extensibility
to
cloud
resources
AgilityControlEfficiency
Choice4Software-Defined
Data
CenterAgility,
Efficiency,
Control
and
Choice
in
the
Mobile-Cloud
EraAll
infrastructure
is
virtualized
and
deliveredas
a
service,
enabling
comprehensive
datacenter
management
through
software
and
extensibility
to
cloud
resources
Agility
EfficiencyControlChoice5Unified
Platform
for
Any
ApplicationScale-up,
Scale-out,
and
Everything
in
BetweenScale-UpScale-Outthan
running
bare-metalvSphere
6:
Great
for
Big
Data630
TB
TeraSort
Benchmark
(32-host
cluster,
4
VMs
per
host)Virtualized
Hadoopup
to
12%
faster11Basedon
VMware
internal
tests,
Jan
2015vSphere
6.0
Tests
–
Hardware
LayoutCONFIDENTIAL7CONFIDENTIAL8Virtualizing
Big
Data:
Value
Proposition
Agility•
Infrastructure
on
demand•
Rapid
scaling
of
clusters
Simplified
Management•
Centralized
data
center
management•
Automated
BD
workloads
configuration
and
deployment
with
best
practices
Efficiency•
Resource
pooling•
Server
and
cluster
consolidation
Performance•
Up
to
12%
better
performance•
No
significant
overhead9vSphere
+
Big
Data
ExtensionsAgility
Physical
Timeframe:
Weeks•
Order
Hardware
(Servers,
Storage,
Etc)•
Wait
for
them
to
arrive•
Setup•
Server
Preparation•
OS
Installation•
Disk
Configuration•
Network
configuration•
Hadoop
Install
and
Configuration
Virtual
Timeframe:
Minutes•
Spin
up
a
new
virtual
machine•
Add
virtual
machine
to
cluster1011
OS
Installation
Network
ConfigurationHadoop
Installation
and
ConfigurationHadoop
Cluster
Deployment
on
VMware
On
physical
machines
Server
PreparationOn
VMware
Big
Data
Extensions
for
VM
creation,
configuration,
start-upBig
Data
Extensions
orother
Hadoop
Management
ToolSimplified
Management12•
Familiar
vSphere
Graphical
User
Interface•
Setup
a
flexible
configuration
and
BDE
will
connect
the
dots
behind
the
scenesBDE
Allows
Flexible
Configurations
Storage
configuration
Choice
of
shared
or
localHigh
Availability
optionNumber
of
nodes
andresource
configurationVM
placement
policiesEnable
Hadoop
As
A
Service
On-premvCenter
PluginvCloud
Automation
CenterHadoop
as
a
ServiceEfficiency•
Better
utilization
of
servers•
Efficient
and
rapid
scaling
of
clusters
through
VM
cloning•
Best
practices
for
VM
placement
onto
servers15Hadoop
1Hadoop
2
HBase•
Consolidated
cluster
has
access
to
entire
pool
of
physical
resources•
Take
advantage
of
multi-tenancy
to
increase
utilization
during
non-peak
hours•
Reduce
latency
on
priority
jobs
on
consolidated
clusterIncrease
Utilization
to
Control
CostsVirtualizing
Big
Data:
Value
Proposition
Agility•
Infrastructure
on
demand•
Rapid
scaling
of
clusters
Simplified
Management•
Centralized
data
center
management•
Automated
BD
workloads
configuration
and
deployment
with
best
practices
Efficiency•
Resource
pooling•
Server
and
cluster
consolidation
Performance•
Up
to
12%
better
performance•
No
significant
overhead17vSphere
+
Big
Data
ExtensionsvCenter
Operations
ManagerIntegrated
Solution
for
Big
Data
Infrastructure
ManagementvCloud
Automation
Center
ElasticityConfiguration
IsolationAvailability
vSphere
vCenterBig
Data
Extensions
Deployment
Multi-tenancy
ESXi
ClusterHow
BDE
Works• • • BDE
is
a
downloadable
virtual
appliance
integrated
as
a
plug-
in
to
vCenter
server.BDE
requires
a
vSphere
5.0
or
later
license
and
an
Enterprise
or
Enterprise
Plus
license.BDE
clones
VMs
from
the
template
and
controls/configures
VMs
through
vCenterHostHostHostHostHostVirtualization
PlatformHadoop
NodeHadoop
NodevCenterManagement
ServerTemplateVirtual
ApplianceDeployment
Options
with
Big
Data
Extensions20
BDE
OnlyBDE
provisions
VMs
and
installs
the
Hadoop
software
BDE
2.0
BDE
provisions
emptyVMsHadoop
management
tool
installs
software
BDE
2.1
BDE
VMs
and
callsmanagement
tool
APIHadoop
management
tool
installs
software
under
the
hoodSkyscape
Cloud
Services•
A
UK
company
that
provides
cloud
computing
services
to
the
UK
Government’s
G-Cloud
initiative.•
Skyscape
offers
IaaS,
PaaS,
SaaS.•
5
customers
lined
up
at
the
first
day
of
GA.•
Expected
to
expand
to
140
servers
very
soon.•
Skyscape
Hadoop
in
the
Cloud
is
built
on
top
of
BDE.•
Used
our
API
quite
independently.•
Public
Reference
with
case
studyAdobe
Case
Study•
Digital
Marketing
business
unit•
Hadoop
as
a
Service
using
vRA
solution•
Goal
is
to
bring
Hadoop
on
AWS
in-house•
Now
in
production•
Currently
several
BDE
clusters
running
in
Oregon•
Hundreds
nodes
across
6
data
centers
by
VMworld•
Performance
meeting
and
exceeding
expectations•
Public
reference
with
case
studyCONFIDENTIAL2223• Virtualized
• based
onvSphere6.0
and
VMware
Big
Data
Extensions
2.2•
EMC
Isilon
7.2.0.2
with
two
patches
for
HDFS
Storage•
Pivotal
Big
Data
Suite
3.0
for
Hadoop
2.6
andHAWQ
1.3
•
Pivotal
Spring
XD
1.2
for
Data
Ingestion
to
Hadoop•
Alpine
Data
Lab
5.4
for
running
• • Deeper
Analytic
FunctionsMachine
Learning
Algorithms
•
HUE
2.6
for
GUI
based
HIVE/PIG
Query
execution
client
VMware
ITThis
Big
Data
Cluster
is
fully
At
VMware
IT,
we
have
established
the
fact
that
an
Enterprise
BigData
Analytics
Platform
can
be
successfully
built
and
run
on
top
of
VMware
Virtual
Infrastructure
with
EMC
Isilon
and
PHD
3.0
-with
great
performance
Take
Away
…Hybrid
storage
model
-
the
best
of
both
worlds
– Master
nodes– NameNode,
ResourceManager,
ZooKeeper
etc.
on
shared
storage
• Leverage
vSphere
vMotion,
HA
and
FT– Worker
nodes
• NodeManager/DataNode
on
local
storage
• Lower
cost,
scalable
bandwidth
• Temp
data
is
written
to
local
storage
for
best
performance
• Scale-out
NAS
storage
for
HDFS
data
is
a
good
alternative
to
local
Shared
Storage
Local
StorageNAS
(Isilon)
for
HDFSCombined
Model
–
Two
Virtual
Machines
on
a
HostVirtualization
HostHadoopVirtualNode
1DataNodeNodeManager
Ext4VMDKShared
storageSAN/NASLocal
disks
Ext4
OS
Image
–
VMDKOS
Image
–
VMDKExt4VMDKExt4
VMDKVMDKHadoopVirtualNode
2DataNode
Ext4VMDKNodeManager
Ext4VMDKExt4VMDKExt4
VMDK
Virtualiza*on
Host
VMDK
Hadoop
Virtual
Node
1
Ext4
Separated
Model
with
VMDK
Isolation
NodeManager
Shared
storage
SAN/NAS
OS
Image
–
VMDK
OS
Image
–
VMDK
Hadoop
Virtual
Node
2
Datanode
VMDK
VMDK
VMDK
Ext4
Ext4
Ext4
VMDK
VMDK
VMDK
Ext4
Ext4
Ext4
Local
disks
for
Temp
Data
JBOD
Local
disks
for
HDFS
Data
JBOD
Tenant
1
Tenant
2Compute
StorageCombined
Compute
andStorage
StorageVMVMVMVMVMVMUnmodified
Hadoop
node
in
a
VM
•
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 模拟卷九专业实务附有答案
- 2024年涤纶高弹丝项目提案报告模板
- 2024年盆景项目规划申请报告
- 2024年三年级的日记集合五篇
- 2024年一次军训作文(15篇)
- 2024年工程照明项目申请报告
- 2024年电脑综合测井诊断仪项目提案报告范文
- 2024年石油钻采机械项目申请报告范样
- 小学竞选三好学生演讲稿范文6篇
- 2024-2030年中国补钙食品市场发展分析及市场趋势与投资方向研究报告
- 六年级语文下册《毕业赠言》教学课件
- tsunami海啸介绍英语PPT
- 广播电视技术概论
- 中国历史知到章节答案智慧树2023年乳山市职业中等专业学校
- 电气安全规程及规范
- 《道路运输车辆动态监督管理办法》全文释义
- 浅谈如何在体育教学中渗透劳动教育 论文
- 住宅小区室外管网工程施工组织设计
- 玩转数字媒体技术智慧树知到答案章节测试2023年南华大学
- 跨文化交际(西北大学)智慧树知到答案章节测试2023年
- 液化石油气公司经营方案
评论
0/150
提交评论