手记

大数据平台一键搭建

想不想快速拥有大数据环境?如何才能一键安装?傻瓜式引导?这里就有你想要的,我们可以使用hortonworks出品的sandbox-hdp来搭建,它是建立在docker环境之上的集群,很轻易的主可以把环境搭建起来。

Big-Data.png

使用方法

这里我会使用HDP的最新版本3.0.1作为演示,包含的组件版本如下

组件版本
HDFS3.1.1
YARN3.1.1
MapReduce23.1.1
Tez0.9.1
Hive3.1.0
HBase2.0.0
Pig0.16.0
Sqoop1.4.7
Oozie4.3.1
ZooKeeper3.4.6
Storm1.2.1
Infra Solr0.1.0
Atlas1.0.0
Kafka1.1.1
Knox1.0.0
Ranger1.1.0
Spark22.3.1
Zeppelin Notebook0.8.0
Data Analytics Studio1.0.2.0.0
Druid1.12.1
Superset0.23.0

安装步骤

先下载docker镜像(打个预防针:大小占 26G 慢慢下吧哈,但收获很大)

docker pull hortonworks/sandbox-hdp:3.0.1
docker pull hortonworks/sandbox-proxy:1.0

下载启动配置脚本

git clone https://github.com/dounine/sandbox-hdp-3.0.1.git

添加hosts映射

127.0.0.1 sandbox-hdp.hortonworks.com# 或者是使用公网IP或局域网IP

启动配置

./docker-deploy-hdp265.sh

成功结果

root@lake /s/d/sandbox-hdp-3.0.1# ./docker-deploy-hdp265.sh + registry=hortonworks                                                                                                                                                 
+ name=sandbox-hdp                                                                                                                                                     
+ version=3.0.1                                                                                                                                                        + proxyName=sandbox-proxy                                                                                                                                              
+ proxyVersion=1.0                                                                                                                                                     + flavor=hdp                                                                                                                                                           
+ echo hdp                                                                                                                                                             
+ mkdir -p sandbox/proxy/conf.d                                                                                                                                        
+ mkdir -p sandbox/proxy/conf.stream.d                                                                                                                                 
+ docker pull hortonworks/sandbox-hdp:3.0.1                                                                                                                            3.0.1: Pulling from hortonworks/sandbox-hdp                                                                                                                            
Digest: sha256:7b767af7b42030fb1dd0f672b801199241e6bef1258e3ce57361edb779d95921                                                                                        
Status: Image is up to date for hortonworks/sandbox-hdp:3.0.1                                                                                                          + docker pull hortonworks/sandbox-proxy:1.0                                                                                                                            1.0: Pulling from hortonworks/sandbox-proxy                                                                                                                            
Digest: sha256:42e4cfbcbb76af07e5d8f47a183a0d4105e65a1e7ef39fe37ab746e8b2523e9e                                                                                        
Status: Image is up to date for hortonworks/sandbox-proxy:1.0                                                                                                          + '[' hdp == hdf ']'                                                                                                                                                   + '[' hdp == hdp ']'                                                                                                                                                   + hostname=sandbox-hdp.hortonworks.com                                                                                                                                 
++ docker images                                                                                                                                                       
++ grep hortonworks/sandbox-hdp                                                                                                                                        
++ awk '{print $2}'                                                                                                                                                    + version=3.0.1                                                                                                                                                        + docker network create cda                                                                                                                                            
+ docker run --privileged --name sandbox-hdp -h sandbox-hdp.hortonworks.com --network=cda --network-alias=sandbox-hdp.hortonworks.com -d hortonworks/sandbox-hdp:3.0.1 46bf6b414dd3c0fb36a3816eac129219d30d49ea9421898158800e0ab3576048                                                                                                       
+ echo ' Remove existing postgres run files. Please wait'                                                                                                              
 Remove existing postgres run files. Please wait                                                                                                                       
+ sleep 2                                                                                                                                                              + docker exec -t sandbox-hdp sh -c 'rm -rf /var/run/postgresql/*; systemctl restart postgresql;'                                                                       Failed to restart postgresql.service: Unit not found.                                                                                                                  
+ sed s/sandbox-hdp-security/sandbox-hdp/g assets/generate-proxy-deploy-script.sh                                                                                      
+ mv -f assets/generate-proxy-deploy-script.sh.new assets/generate-proxy-deploy-script.sh                                                                              
+ chmod +x assets/generate-proxy-deploy-script.sh                                                                                                                      
+ assets/generate-proxy-deploy-script.sh                                                                                                                               
+ uname                                                                                                                                                                
+ grep MINGW                                                                                                                                                           
+ chmod +x sandbox/proxy/proxy-deploy.sh                                                                                                                               
+ sandbox/proxy/proxy-deploy.sh                                                                                                                                        
7fa5c4d0737a6b71796fe997baf397d4078907d83fcfaa2a8c0f241772547147

需要先重置一下ambari的密码才能登录

docker exec -ti sandbox-hdp bash
ambari-admin-password-reset #重置密码登录

结果

[root@sandbox-hdp /]# ambari-admin-password-reset                                                                                                                      Please set the password for admin:                                                                                                                                     
Please retype the password for admin:                                                                                                                                  
                                                                                                                                                                       
The admin password has been set.                                                                                                                                       
Restarting ambari-server to make the password change effective...                                                                                                      
                                                                                                                                                                       
Using python  /usr/bin/python                                                                                                                                          
Restarting ambari-server                                                                                                                                               
Waiting for server stop...                                                                                                                                             
Ambari Server stopped                                                                                                                                                  
Ambari Server running with administrator privileges.                                                                                                                   
Organizing resource files at /var/lib/ambari-server/resources...                                                                                                       
Ambari database consistency check started...                                                                                                                           
Server PID at: /var/run/ambari-server/ambari-server.pid                                                                                                                
Server out at: /var/log/ambari-server/ambari-server.out                                                                                                                
Server log at: /var/log/ambari-server/ambari-server.log                                                                                                                Waiting for server start...................                                                                                                                            
Server started listening on 8080                                                                                                                                       
                                                                                                                                                                       DB configs consistency check: no errors and warnings were found.

输入帐号密码登录 ambar webUI http://localhost:8080

Ambari界面


PS:后续还有其它相关的大数据文章



作者:dounine
链接:https://www.jianshu.com/p/6af5301e9780


0人推荐
随时随地看视频
慕课网APP