关于hive数据导入方式的总结

2021-11-07 15:15:13

从本地导入数据到hive：

load data local inpath '/home/hive/tb_dw_cu_three_type_list_dtal/*.dat' overwrite into table csap.tb_dw_cu_three_type_list_dtal partition(statis_date=20160121);

从hdfs直接导入数据到hive：

load data inpath '/hadoop/interface/renadata/aol_open_call_log_*_201602_20160215.dat' into table csap.tb_ods_ct_rena_open_call_log_all_day partition(statis_date=20160215);

请注意load data inpath ‘/home/wyp/add.txt’ into table

wyp;里面是没有local这个单词的，这个是和本地的区别。

从一个表中导数据到另一张表：

insert into table test partition (age='25')

select id, name, tel from wyp;

hive支持多表插入：

from wyp

insert into table test partition(age)

select id, name, tel, age

insert into table test3

select id, name where age>25;

注意：传统数据块的形式insert into table values（字段1，字段2），这种形式hive是不支持的。

分区：在hive中，表的每一个分区对应表下的相应目录，所有分区的数据都是存储在对应的目录中。

装载数据相当于把之前hdfs上的数据移动到hive对应的目录下。

关于hive数据导入方式的总结

继续阅读

Eclipse运行WordCount（详细版）相关连接Eclipse运行WordCount

Oracle的基本操作

BMP文件结构及图像每行字节计算方法

磁盘结构及在Linux中的命名

SQL优化SQL语句优化的目的

JAVA高效编程指南

hadoop 用MR实现join操作

Centos7 下 Hadoop 2.6.4 分布式集群环境搭建摘要集群准备安装JDK 安装 Hadoop 2.6.4 部署 slaver1-slaver4 启动 hadoop 集群成功了

关于SQL语言

SQL语言基础：常用的数据查询语句

MapReduce的几个企业级经典面试案例MapReduce的几个企业级经典面试案例

ubuntu14.04下安装hbse1.0.1.1

User Defined Hadoop DataType

neo4j之cypher使用文档

Ambari介绍和架构原理

sqlServer根据经纬查距离