DataSophon集成ApacheImpala的过程

注意: 本次安装操作系统环境为Anolis8.9(Centos7和Centos8应该也一样) 

        DataSophon版本为DDP-1.2.1

整合的安装包我放网盘了:

通过网盘分享的文件:impala-4.4.1.tar.gz等2个文件
链接: https://pan.baidu.com/s/18KfkO_BEFa5gVcc16I-Yew?pwd=za4k 提取码: za4k

1. Apache Impala的版本我选择的是目前Github上的最新版本4.4.1

Github上 Impala提供了rpm和deb两种安装方式,由于编译源码涉及到python包以及C++包机器难下载,所以我选择了Github提供的rpm包(注意RPM包中没有包含shell目录 deb包中是有的,如果使用rpm包的话,需要手动将这个shell文件夹拷贝进来).

2. 首先下载apache-impala-4.4.1-RELEASE_hive-3.1.3-x86_64.el8.8.rpm

        将rpm包移动到/opt目录下,然年执行yum安装

wget https://github.com/apache/impala/releases/download/4.4.1/apache-impala-4.4.1-RELEASE_hive-3.1.3-x86_64.el8.8.rpm
mv apache-impala-4.4.1-RELEASE_hive-3.1.3-x86_64.el8.8.rpm /opt
yum install -y apache-impala-4.4.1-RELEASE_hive-3.1.3-x86_64.el8.8.rpm

记得安装好之后将shell文件夹拷贝进来啊

3. 安装之后得到/opt/impala文件夹,重命名为/opt/impala-4.4.1

cd /opt
mv impala impala-4.4.1

4. 修改conf/impala-env.sh脚本命令,修改以下三个的值

: ${JAVA_HOME:=/usr/local/jdk}# Specify extra CLASSPATH.
: ${CLASSPATH:=${IMPALA_HOME}/conf/:${IMPALA_HOME}/lib/jars/*}# Specify extra LD_LIBRARY_PATH.
: ${LD_LIBRARY_PATH:=${IMPALA_HOME}/lib/native/:${JAVA_HOME}/jre/lib/amd64/server/}

5.  在conf下将Hadoop的core-site.xml hdfs-site.xml以及hive的core-site.xml拷贝进来

hive-site.xml内容我只放了这些

<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration><property><name>hive.metastore.warehouse.dir</name><value>/user/hive/warehouse</value></property><property><name>hive.metastore.port</name><value>9083</value></property><property><name>hive.metastore.uris</name><value>thrift://node01:9083</value></property><property><name>hive.metastore.event.db.notification.api.auth</name><value>false</value></property><property><name>hive.metastore.dml.events</name><value>true</value></property><property><name>hive.metastore.transactional.event.listeners</name><value>org.apache.hive.hcatalog.listener.DbNotificationListener</value></property></configuration>

6. 打包tar.gz,并拷贝到DDP/packages目录下以及生成md5文件

cd /opt
# 打包tar.gz
tar -zcvf impala-4.4.1.tar.gz impala-4.4.1
# 将tar.gz包拷贝到DDP/packages目录下
cp impapa-4.4.1.tar.gz /opt/datasophon/DDP/packages
cd /opt/datasophon/DDP/packages
# 生成md5文件
java -jar file-md5-1.0-SNAPSHOT-jar-with-dependencies.jar impala-4.4.1.tar.gz

7. 编写IMPALA的service_ddl.json配置文件

cd /opt/datasophon/datasophon-manager-1.2.1/conf/meta/DDP-1.2.1/
mkdir IMPALA
cd IMPALA
# 创建json文件并写入内容
vi service_ddl.json
{"name": "IMPALA","label": "Impala","description": "MPP(大规模并行处理)SQL查询引擎","version": "4.4.1","sortNum": 22,"dependencies": ["HDFS", "HIVE"],"packageName": "impala-4.4.1.tar.gz","decompressPackageName": "impala-4.4.1","roles": [{"name": "StateStored","label": "StateStored","roleType": "master","runAs": {"user": "impala","group": "hadoop"},"cardinality": "1+","sortNum": 1,"logFile": "/var/log/impala/statestored.INFO","jmxPort": 2191,"startRunner": {"timeout": "60","program": "bin/impala.sh","args": ["start", "statestored"]},"stopRunner": {"timeout": "600","program": "bin/impala.sh","args": ["stop", "statestored"]},"statusRunner": {"timeout": "60","program": "bin/impala.sh","args": ["status", "statestored"]},"restartRunner": {"timeout": "60","program": "bin/impala.sh","args": ["restart", "statestored"]},"externalLink": {"name": "StateStored Ui","label": "StateStored Ui","url": "http://${host}:25010"}}, {"name": "Catalogd","label": "Catalogd","roleType": "master","runAs": {"user": "impala","group": "hadoop"},"cardinality": "1+","sortNum": 2,"logFile": "/var/log/impala/catalogd.INFO","jmxPort": 2191,"startRunner": {"timeout": "60","program": "bin/impala.sh","args": ["start", "catalogd"]},"stopRunner": {"timeout": "600","program": "bin/impala.sh","args": ["stop", "catalogd"]},"statusRunner": {"timeout": "60","program": "bin/impala.sh","args": ["status", "catalogd"]},"restartRunner": {"timeout": "60","program": "bin/impala.sh","args": ["restart", "catalogd"]},"externalLink": {"name": "Catalogd Ui","label": "Catalogd Ui","url": "http://${host}:25020"}}, {"name": "Impalad","label": "Impalad","roleType": "worker","runAs": {"user": "impala","group": "hadoop"},"cardinality": "1+","sortNum": 3,"logFile": "/var/log/impala/impalad.INFO","jmxPort": 2191,"startRunner": {"timeout": "60","program": "bin/impala.sh","args": ["start", "impalad", "--enable_legacy_avx_support"]},"stopRunner": {"timeout": "600","program": "bin/impala.sh","args": ["stop", "impalad"]},"statusRunner": {"timeout": "60","program": "bin/impala.sh","args": ["status", "impalad"]},"restartRunner": {"timeout": "60","program": "bin/impala.sh","args": ["restart", "impalad", "--enable_legacy_avx_support"]}}],"configWriter": {"generators": [{"filename": "statestored_flags","configFormat": "properties","outputDirectory": "conf","includeParams": ["-hostname", "-log_dir", "-minidump_path", "custom.statestored_flags"]}, {"filename": "catalogd_flags","configFormat": "properties","outputDirectory": "conf","includeParams": ["-hostname", "-state_store_host", "-log_dir", "-minidump_path", "custom.catalogd_flags"]}, {"filename": "impalad_flags","configFormat": "properties","outputDirectory": "conf","includeParams": ["-hostname", "-state_store_host", "-catalog_service_host", "-log_dir", "-minidump_path", "-mem_limit", "custom.impalad_flags"]}]},"parameters": [{"name": "-hostname","label": "impalad部署节点IP","description": "impalad部署节点IP","required": true,"type": "input","value": "${host}","configurableInWizard": true,"hidden": false,"defaultValue": "${host}"}, {"name": "-catalog_service_host","label": "catalog_service_host部署节点IP","description": "catalog_service_host部署节点IP","required": true,"type": "input","value": "node01","configurableInWizard": true,"hidden": false,"defaultValue": "${host}"}, {"name": "-state_store_host","label": "statestore部署节点IP","description": "statestore部署节点IP","required": true,"type": "input","value": "node01","configurableInWizard": true,"hidden": false,"defaultValue": "${host}"}, {"name": "-log_dir","label": "log_dir日志路径","description": "log_dir日志路径","required": true,"type": "input","value": "/var/log/impala","configurableInWizard": true,"hidden": false,"defaultValue": "/var/log/impala"}, {"name": "-minidump_path","label": "minidump_path路径","description": "minidump_path路径","required": true,"type": "input","value": "/var/log/impala/minidumps","configurableInWizard": true,"hidden": false,"defaultValue": "/var/log/impala/minidumps"}, {"name": "-mem_limit","label": "mem_limit","description": "mem_limit","required": true,"type": "input","value": "80%","configurableInWizard": true,"hidden": false,"defaultValue": "80%"}, {"name": "custom.statestored_flags","label": "自定义配置statestored_flags","description": "自定义配置","configType": "custom","required": true,"type": "multipleWithKey","value": [{"-v": "1"},{"-log_filename": "statestored"},{"-max_log_files": "10"},{"-max_log_size": "200"}],"configurableInWizard": true,"hidden": false,"defaultValue": ""}, {"name": "custom.catalogd_flags","label": "自定义配置catalogd_flags","description": "自定义配置","configType": "custom","required": true,"type": "multipleWithKey","value": [{"-v": "1"},{"-log_filename": "catalogd"},{"-max_log_files": "10"},{"-max_log_size": "200"}],"configurableInWizard": true,"hidden": false,"defaultValue": ""}, {"name": "custom.impalad_flags","label": "自定义配置impalad_flags","description": "自定义配置","configType": "custom","required": true,"type": "multipleWithKey","value": [{"-v": "1"},{"-log_filename": "impalad"},{"-max_log_files": "10"},{"-max_log_size": "200"},{"-scratch_dirs": "/data/impala/impalad"}],"configurableInWizard": true,"hidden": false,"defaultValue": ""}]
}

8. 重启datasophon的api服务

cd /opt/datasophon/datasophon-manager-1.2.1/
启动:sh bin/datasophon-api.sh start api
停止:sh bin/datasophon-api.sh stop api
重启:sh bin/datasophon-api.sh restart api

9. 回到网页中就可以进行服务的安装了(注意安装过程中修改catalogd以及satastored所在的服务器的ip或者host名称)

由于我的目录设置为/var/log/impala, impala并没有权限进行创建该目录,只能手动创建(这个问题我知道咋解决),这个步骤要在安装服务之前就做吧.

mkdir -p /var/log/impala/minidumps
chmod 777 /var/log/impala
chmod 777 var/log/impala/minidumps

如果有读者想自己打包安装可以参考以下博客

编译部署apache-impala | 子崖说

Datasophon集成impala | 子崖说

https://zhuanlan.zhihu.com/p/348344999

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.rhkb.cn/news/460012.html

如若内容造成侵权/违法违规/事实不符,请联系长河编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

使用 Kibana 将地理空间数据导入 Elasticsearch 以供 ES|QL 使用

作者&#xff1a;来自 Elastic Craig Taverner 如何使用 Kibana 和 csv 采集处理器将地理空间数据采集到 Elasticsearch 中&#xff0c;以便在 Elasticsearch 查询语言 (ES|QL) 中进行搜索。Elasticsearch 具有强大的地理空间搜索功能&#xff0c;现在 ES|QL 也具备这些功能&am…

ffmpeg视频滤镜:定向模糊-dblur

滤镜简述 dblur 官网链接 > https://ffmpeg.org/ffmpeg-filters.html#dblur 有一个模糊滤镜&#xff0c;我试了一下&#xff0c;没有感觉到它的特殊之处, 这里简单介绍一下。 滤镜使用 滤镜的参数 angle <float> ..FV.....T. set angle (from 0 t…

浏览器HTTP缓存解读(HTTP Status:200 304)

为什么要有浏览器缓存&#xff1f; 浏览器缓存(Brower Caching)是浏览器对之前请求过的文件进行缓存&#xff0c;以便下一次访问时重复使用&#xff0c;节省带宽&#xff0c;提高访问速度&#xff0c;降低服务器压力 http缓存机制主要在http响应头中设定&#xff0c;响应头中…

Facebook登录崩溃?别急,神奇秘籍拯救你丨出海笔记

今天文章很短&#xff0c;但非常实用&#xff01; 相信运营同学或多或少都会在登录facebook的时候遇到: 您所关注的链接可能已损坏&#xff0c;或页面可能已被移除。 This page isnt available, The link you followed may be broken, or the page may have been removed. 然…

Redmi Note 12 Turbo 1TB root教程

文章目录 通过手机下载完整的系统包通过payload-dumper-go 提取boot.img把下载的安装包直接拉到payload-dumper-go.exe通过第三步 找到boot.img把boot.img 拉到手机Download&#xff0c;通过magisk修补boot修补完成后找到这个文件&#xff0c;通过adb刷入 连接不上adb 通过手机…

Xcode 16.1 (16B40) 发布下载 - Apple 平台 IDE

Xcode 16.1 (16B40) 发布下载 - Apple 平台 IDE IDE for iOS/iPadOS/macOS/watchOS/tvOS/visonOS 发布日期&#xff1a;2024 年 10 月 28 日 Xcode 16.1 包含适用于 iOS 18.1、iPadOS 18.1、Apple tvOS 18.1、watchOS 11.1、macOS Sequoia 15.1 和 visionOS 2.1 的 SDK。Xco…

Docker-微服务项目部署

环境准备 1.微服务项目 参考&#xff1a;通过网盘分享的文件&#xff1a;wolf2w_cloud.zip 链接: https://pan.baidu.com/s/1Lr4k6LPIJ59gVNA_DgKM_Q?pwdkjxt 提取码: kjxt 前端项目&#xff1a;trip-mgrsite-ui&#xff0c;trip-website-ui&#xff0c;trip-wenda-ui 服务项…

【Redis】常见基本全局命令

一、Redis俩大核心命令 由于Redis是以键值对的形式进行数据存取&#xff0c;自然就离不开不断的存储和获取&#xff0c;而其所对应的命令则是set和get&#xff0c;如此说来二者为Redis的核心基础命令也不为过。 作用&#xff1a;用于存储Stirng类型的数据 返回&#xff1a;当…

Ubuntu下安装和配置MySQL5.7教程

提示&#xff1a;文章写完后&#xff0c;目录可以自动生成&#xff0c;如何生成可参考右边的帮助文档 目录 前言 在ubuntu下安装MySQL数据库 查看操作系统版本 ​编辑 添加 MySQL APT 源 访问下载页面并下载发布包 安装发布包 安装MySQL 查看MySQL状态 开启自启动 登…

普通人适合做大模型吗?过程中会发生什么潜在的挑战?

对于普通人来说&#xff0c;直接进行大模型的研发和训练可能存在一定的挑战&#xff0c;因为这通常需要以下资源和知识&#xff1a; 专业知识&#xff1a; 大模型的开发需要深入理解机器学习、深度学习、神经网络等领域的知识。 计算资源&#xff1a; 大模型的训练需要高性能的…

(蓝桥杯C/C++)——常用库函数

提示&#xff1a;文章写完后&#xff0c;目录可以自动生成&#xff0c;如何生成可参考右边的帮助文档 目录 一、 二分查找 1.二分查找的前提 2.binary_ search函数 3.lower_bound和upper_bound 二、排序 1.sort概念 2.sort的用法 3.自定义比较函数 三、全排列 1.next p…

qt配置https请求

qt应用版本 windows 32位 先说下心理路程&#xff0c;你能遇到的我都遇到了&#xff0c;你能想到的我都想到了&#xff0c;怎么解决看这一篇就够了&#xff0c;从上午12点到晚上12点几乎没离开电脑&#xff08;除了吃饭&#xff09;&#xff0c;对于openssl这种用的时候无感&am…

HTML练习题:彼岸的花(web)

展示效果: 代码: <!DOCTYPE html> <html lang"en"> <head><meta charset"UTF-8"><meta name"viewport" content"widthdevice-width, initial-scale1.0"><title>彼岸の花</title><style…

学STM32选标准库还是HAL库?

我学STM32的时候&#xff0c;stm32cubeMX还没推出&#xff0c;HAL库还没出来&#xff0c;主要以标准库为准。 下面讲下我学习STM32的经历&#xff0c;从陌生到应用到项目大概花了2-3周&#xff0c;相信99%的人都能通过这种方法&#xff0c;快速把STM32玩起来。 我是项目需要&am…

池化层笔记

池化层 文章目录 池化层二维池化层超参数池化层的分类代码实现填充和步幅 多个通道 总结 卷积对位置敏感&#xff0c;可以检测垂直边缘。需要有一定程度的平移不变性&#xff0c;而在平时图片的拍摄&#xff0c;会因为图片的照明&#xff0c;物体位置&#xff0c;比例&#xff…

以太网交换安全:DHCP Snooping

一、DHCP Snooping的概念及功能 DHCP Snooping是一种用于增强网络中DHCP服务安全性的技术。以下是对以太网交换安全中的DHCP Snooping进行详细的介绍&#xff1a; 基本概述 定义目的&#xff1a;DHCP Snooping是一种网络安全技术&#xff0c;旨在防止未经授权的DHCP服务器在网…

【问题记录】解决VMware虚拟机中鼠标侧键无法使用的问题

前言 有项目需要在Linux系统中开发&#xff0c;因为要测试Linux中相关功能&#xff0c;要用到shell&#xff0c;在Windows中开发太麻烦了&#xff0c;因此我选择使用UbuntuXfce4桌面来开发&#xff0c;这里我用到了Linux版本的IDEA&#xff0c;除了快捷键经常和系统快捷键冲突…

[JAVAEE] 面试题(二) - CAS 和 原子类

目录 一. CAS的实现原理 1.1 伪代码分析 1.2 底层实现 二. CAS 操作示例 三. ABA问题 四. 原子类 4.1 使用原子类的目的 4.2 原子类的使用示例 五. 总结 一. CAS的实现原理 CAS(compare and swap 比较和交换)是一种用于实现无锁并发的技术. 1.1 伪代码分析 // 伪代…

Ubuntu 20.04 安装 OpenCV 和 OpenCV_contrib 教程

Ubuntu 20.04 安装 OpenCV 和 OpenCV_contrib 教程 Ubuntu 20.04 安装 OpenCV 和 OpenCV_contrib 教程前言 OpenCV概述核心功能优势特点应用领域安装与使用 OpenCV_contrib概述核心功能具体模块 安装与使用一、准备工作二、下载OpenCV和OpenCV_contrib三、编译和安装OpenCV四、…

HCIP--2 TCP有关笔记

学习目标&#xff1a; 目录 学习目标&#xff1a; 学习内容&#xff1a; TCP协议 七层结构 END.......... 问题&#xff1a;TCP/IP 的四层协议 问题&#xff1a;网络的七层结构及其作用 问题&#xff1a;协议名称是什么&#xff1f; TELNET---远程登陆系统 HTTP---超文本传输协…