Hive快速入门系列(9) | Hive表中数据的加载与导出-伙伴云

Hive 快速入门系列(9) | Hive表中数据的加载与导出

网友投稿 954 2022-05-29

本次博主为大家带来的是Hive表中数据的加载与导出。希望能够帮助到大家。

一. Hive表中加载数据

1.1 直接向分区表中插入数据

1.2 通过查询插入数据

1.3 多插入模式

1.4 查询语句中创建表并加载数据（as select）

1.5 创建表时通过location指定加载数据路径

二. Hive表中的数据导出（了解就行）

2.1 insert导出

2.2 Hadoop命令导出到本地

2.3 hive shell 命令导出

2.4 export导出到HDFS上(全表导出)

三. 清空表数据

一. Hive表中加载数据

1.1 直接向分区表中插入数据

create table score3 like score; insert into table score3 partition(month ='201807') values ('001','002','100');

1.2 通过查询插入数据

1. 通过load方式加载数据

(linux) load data local inpath ‘/export/servers/hivedatas/score.csv’ overwrite into table score partition(month=‘201806’); (HDFS) load data inpath ‘/export/servers/hivedatas/score.csv’ overwrite into table score partition(month=‘201806’);

2. 通过查询方式加载数据

create table score4 like score; insert overwrite table score4 partition(month = '201806') select s_id,c_id,s_score from score;

关键字overwrite 必须要有

1.3 多插入模式

常用于实际生产环境当中，将一张表拆开成两部分或者多部分

1. 给score表加载数据

load data local inpath '/export/servers/hivedatas/score.csv' overwrite into table score partition(month='201806');

2. 创建第一部分表：

create table score_first( s_id string,c_id string) partitioned by (month string) row format delimited fields terminated by '\t' ;

3. 创建第二部分表：

create table score_second(c_id string,s_score int) partitioned by (month string) row format delimited fields terminated by '\t';

4. 分别给第一部分与第二部分表加载数据

from score insert overwrite table score_first partition(month='201806') select s_id,c_id insert overwrite table score_second partition(month = '201806') select c_id,s_score;

1.4 查询语句中创建表并加载数据（as select）

将查询的结果保存到一张表当中去

create table score5 as select * from score;

1.5 创建表时通过location指定加载数据路径

1. 创建表，并指定在hdfs上的位置

create external table score6 (s_id string,c_id string,s_score int) row format delimited fields terminated by '\t' location '/myscore6';

2. 上传数据到hdfs上

hdfs dfs -mkdir -p /myscore6 hdfs dfs -put score.csv /myscore6;

3. 查询数据

select * from score6;

二. Hive表中的数据导出（了解就行）

将hive表中的数据导出到其他任意目录，例如linux本地磁盘，例如hdfs，例如mysql等等

2.1 insert导出

1. 将查询的结果导出到本地

insert overwrite local directory '/export/servers/exporthive' select * from score;

2. 将查询的结果格式化导出到本地

insert overwrite local directory '/export/servers/exporthive' row format delimited fields terminated by '\t' collection items terminated by '#' select * from student;

3. 将查询的结果导出到HDFS上(没有local)

insert overwrite directory '/export/servers/exporthive' row format delimited fields terminated by '\t' collection items terminated by '#' select * from score;

2.2 Hadoop命令导出到本地

dfs -get /export/servers/exporthive/000000_0 /export/servers/exporthive/local.txt;

2.3 hive shell 命令导出

基本语法：（hive -f/-e 执行语句或者脚本 > file）

bin/hive -e "select * from myhive.score;" > /export/servers/exporthive/score.txt

2.4 export导出到HDFS上(全表导出)

export table score to '/export/exporthive/score';

三. 清空表数据

只能清空管理表，也就是内部表

truncate table score6;

清空这个表会报错

本次的分享就到这里了,

看完就赞，养成习惯！！！ \color{#FF0000}{看完就赞，养成习惯！！！} 看完就赞，养成习惯！！！^ _ ^ ❤️ ❤️ ❤️

码字不易，大家的支持就是我坚持下去的动力。后不要忘了关注我哦！

Hadoop Hive

elasticsearch入门 系列">elasticsearch入门 系列

954 2022-05-29

快速跳到我想要的那一页（怎么快速到下一页）">怎么快速跳到我想要的那一页（怎么快速到下一页）

954 2022-05-29

深入浅出etcd系列】3. 日志同步">【深入浅出etcd系列】3. 日志同步

954 2022-05-29

Hive 快速 入门 系列(9) | Hive表中数据的加载与导出

elasticsearch入门 系列">elasticsearch入门 系列

快速跳到我想要的那一页（怎么快速到下一页）">怎么快速跳到我想要的那一页（怎么快速到下一页）

深入浅出etcd系列】3. 日志同步">【深入浅出etcd系列】3. 日志同步

推荐文章

企业生产管理是什么，企业生产管理软件

进盘点进销存软件排行榜前十名

进销存系统哪个简单好用？进销存系统优点

工厂生产管理（工厂生产管理流程及制度）

生产管理软件，机械制造业生产管理，制造业生产过程管理软件

进销存软件和ERP有什么区别？进销存与erp软件理解

进销存如何进行库存管理

如何利用excel制作销售订单管理系统？

数据库订单管理系统有哪些功能？数据库订单管理系统怎么设计？

什么是数据库管理系统？

最近发表

热评文章

零代码开发是什么？2022低代码平台排行榜">零代码开发是什么？2022低代码平台排行榜

进销存库存管理 系统（智慧进销存）">智能进销存库存管理系统（智慧进销存）

在线文档哪家强？8款在线文档编辑软件推荐">在线文档哪家强？8款在线文档编辑软件推荐

WPS2016怎么绘制简单的价格表?

智能定制家居管理系统：重新定义家庭生活方式

客户管理工具是什么？">客户管理工具是什么？

友情链接

Hive快速入门系列(9) | Hive表中数据的加载与导出

微信扫一扫：分享

elasticsearch入门系列">elasticsearch入门系列

快速跳到我想要的那一页（怎么快速到下一页）">怎么快速跳到我想要的那一页（怎么快速到下一页）

深入浅出etcd系列】3. 日志同步">【深入浅出etcd系列】3. 日志同步

推荐文章

最近发表

热评文章

零代码开发是什么？2022低代码平台排行榜">零代码开发是什么？2022低代码平台排行榜

进销存库存管理系统（智慧进销存）">智能进销存库存管理系统（智慧进销存）

在线文档哪家强？8款在线文档编辑软件推荐">在线文档哪家强？8款在线文档编辑软件推荐

客户管理工具是什么？">客户管理工具是什么？

友情链接

Hive 快速入门系列(9) | Hive表中数据的加载与导出