Sqoop

Describing Sqoop

  • Sqoop is a software framework
  • It is used for:

    • Importing data into HDFS
    • Exporting processed data from HDFS to any RDBMS
  • There are sqoop commands for the following:

    • Listing all of our databases
    • Listing tables in a specified database
    • Fetching data from HDFS
    • Exporting data from an RDBMS into HDFS
    • Exporting data from an RDBMS into HBase
    • Loading data into HBase
    • Performing minor data summaries based on:

      • Rows
      • Columns
      • Tables
      • etc.

References

Previous
Next

Hive

ZooKeeper