How to create a Hive table from sequence file stored in HDFS?

Question

What i need is:I have sequence stored in HDFS, I have to create the table for that sequence file. what is the serde used here?

Omkar · Answer

There are two SerDe for SequenceFile as follows:TextSerializerDeserializer: This class can read and write data in plain text file format.BinarySerializerDeserializer: This class can read and write data in binary file format.The default is the SerDe for plain text file in Tajo. The above example statement created the table using TextSerializerDeserializer.If you want to use BinarySerializerDeserializer, you can specify it by&#160;sequencefile.serde keywords:CREATE TABLE tablename (id int, name text, score float, type text)
USING sequencefile with ('sequencefile.serde'='org.apache.tajo.storage.BinarySerializerDeserializer')In Hive, the above statement can be written in Hive as follows:CREATE TABLE tablename (id int, name string, score float, type string)
ROW FORMAT SERDE
 'org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe'
STORED AS sequencefile;WriterThere are three SequenceFile Writers based on the SequenceFile.CompressionType used to compress key/value pairs:Writer :&#160;Uncompressed records.RecordCompressWriter :&#160;Record-compressed files, only compress values.BlockCompressWriter :&#160;Block-compressed files, both keys & values are collected in &#8216;blocks&#8217; separately and compressed. The size of the &#8216;block&#8217; is configurable.The default is Uncompressed Writer in Tajo. If you want to use RecordCompressWriter, you can specify it by compression.type keywords and compression.codec keywords:CREATE TABLE tablename (id int, name text, score float,type text)
USING sequencefile with ('compression.type'='RECORD','compression.codec'='org.apache.hadoop.io.compress.SnappyCodec')In&#160;hive, you need to specify settings as follows:hive> SET hive.exec.compress.output = true;
hive> SET mapred.output.compression.type = RECORD;
hive> SET mapred.output.compression.codec = org.apache.hadoop.io.compress.SnappyCodec;
hive> CREATE TABLE tablename (id int, name string, score float, type string) STORED AS sequencefile;And if you want to use BlockCompressWriter, you can specify it by compression.type keywords and compression.codec keywords:CREATE TABLE tablename (id int, name text, score float, type text)
USING sequencefile with ('compression.type'='BLOCK','compression.codec'='org.apache.hadoop.io.compress.SnappyCodec')
In&#160;hive, you need to specify settings as follows:hive> SET hive.exec.compress.output = true;
hive> SET mapred.output.compression.type = BLOCK;
hive> SET mapred.output.compression.codec = org.apache.hadoop.io.compress.SnappyCodec;
hive> CREATE TABLE tablename (id int, name string, score float, type string) STORED AS sequencefile;;For reference, you can use&#160;TextSerDe&#160;or BinarySerDe with compression keywords. Here is an example statement for this case.CREATE TABLE tablename (id int, name text, score float, type text)
USING sequencefile with ('sequencefile.serde'='org.apache.tajo.storage.BinarySerializerDeserializer', 'compression.type'='BLOCK','compression.codec'='org.apache.hadoop.io.compress.SnappyCodec')In&#160;hive, you need to specify settings as follows:hive> SET hive.exec.compress.output = true;
hive> SET mapred.output.compression.type = BLOCK;
hive> SET mapred.output.compression.codec = org.apache.hadoop.io.compress.SnappyCodec;
hive> CREATE TABLE tablename (id int, name string, score float, type string)
      ROW FORMAT SERDE
        'org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe'
      STORED AS sequencefile;

How to create a Hive table from sequence file stored in HDFS

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Big Data Hadoop

How to create smaller table from big table in HIVE?

Not able to create Hive table from HDFS file

How to unzip a zipped file stored in Hadoop hdfs?

How to create a managed table in Hive?

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

Hadoop dfs -ls command?

How to create a Hive table with a sequence file?

How to create a parquet table in hive and store data in it from a hive table?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES