Hive - upload CSV GZIP files to HDFS and bring them together as an EXTERNAL table
You have a gzipped CSV file that you would upload to a HDFS folder. That folder would then be 'adressed' as an external table by your Hive system. The data will then be inserted into a 'real' big data table.
The benefit: you can upload your compressed files in chunks and then just use them as a single data table in your big data system
You have a gzipped CSV file that you would upload to a HDFS folder. That folder would then be 'adressed' as an external table by your Hive system. The data will then be inserted into a 'real' big data table.
The benefit: you can upload your compressed files in chunks and then just use them as a single data table in your big data system