Hive六种存储格式:
AVRO、ORC、PARQUET、RCFILE、SEQUENCEFILE、TEXTFFILE
(avro、orc、parquet、rcfile、sequencefile、textfile)
语法
|
标准语法
|
STORED AS AVRO / STORED AS AVROFILE
|
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.avro.AvroSerDe' STORED AS INPUTFORMAT 'org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputFormat'
|
STORED AS ORC / STORED AS ORCFILE
|
ROW FORMAT SERDE 'org.apache.hadoop.hive.ql.io.orc.OrcSerde ' STORED AS INPUTFORMAT 'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat ' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat '
|
STORED AS PARQUET / STORED AS PARQUETFILE
|
ROW FORMAT SERDE 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe ' STORED AS INPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat ' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat '
|
STORED AS RCFILE |
STORED AS INPUTFORMAT 'org.apache.hadoop.hive.ql.io.RCFileInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.RCFileOutputFormat'
|
STORED AS SEQUENCEFILE |
STORED AS INPUTFORMAT 'org.apache.hadoop.mapred.SequenceFileInputFormat' OUTPUTFORMAT 'org.apache.hadoop.mapred.SequenceFileOutputFormat'
|
STORED AS TEXTFILE |
STORED AS INPUTFORMAT 'org.apache.hadoop.mapred.TextInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.IgnoreKeyTextOutputFormat'
|