Log-In to post
Download Getting started with Toad™ for Apache™ Hadoop® Guide in PDF
This guide focuses on how to get started with Apache™ Hadoop® easily and comfortably using Toad™ for Apache™ Hadoop® and Cloudera® QuickStart VM. The QuickStart VM contains an entire Hadoop ecosystem, which makes it a great place to start exploring the world of Hadoop.
By following this guide you will learn how to:
Before you can connect to your ecosystem using Toad for Apache Hadoop, you need to configure a few things on your local machine.
To configure your local machine, do the following:
Your VirtualBox version must be 5.0 and higher in order to work with Toad for Apache Hadoop.
Your local machine is now ready. To connect to your Cloudera QuickStart VM Hadoop ecosystem:
In case your Oracle VM VirtualBox contains multiple virtual machines, choose QuickStart VM from dropdown menu to continue.
In real world, you will most likely want to transfer your data stored it relational databases to Hive, where it can be used more effectively. Toad for Apache Hadoop makes this simple, you only need to create a connection to your database and execute a transfer.
To create a connection to your relational database:
With a connection to your relational database ready, it is time to transfer your data to Hive.
With your data stored in Hive, you can now easily view it:
Now that your data is stored in Hive, you will most likely want to work with it using the Hive Query Language:
Please note that in the world of Hadoop, executing queries takes significantly more time as opposed to relational databases. The amount of time varies greatly, depending on the complexity of your query, the amount of relevant data in Hive, the physical location of the data, etc.
If your query fails, the error will be shown in Diagnostics tab.
Hadoop stores all files on HDFS, which in many way acts similar to a local filesystem. You can use Toad for Apache Hadoop to manage files and folders stored in HDFS. This chapter describes basic operations such as:
Note: All files are opened in basic plain text viewer (notepad-like). Images and more complicated file formats are not currently supported and opening them will result in incomprehensible preview (similar to opening such files in notepad).
You can download files/folders from HDFS to your local machine, similar to a network disk storage.
You can also upload files from your local machine to HDFS.