Introduction to Hadoop for Data Analysis

June 12, 2015 - 1:00 pm to 4:00 pm CT
J.J. Pickle Research Campus
ROC 1.900
10100 Burnet Rd
Austin, TX 78758

This introductory course is targeted for users interested in exploring Hadoop as a viable option for their current data analysis workflow. The course will introduce the basic concepts on the MapReduce programming model and Hadoop cluster. Information and instructions on using the Hadoop environment with TACC resources will also be given.

The courses will consist of three sessions. The first session will introduce the basic concept on MapReduce programming model. The second session will introduce basics of hdfs and hadoop clusters: common commands and how to get started with Hadoop clusters at TACC. The third session will introduce Hadoop Ecosystem and common components, such as Hadoop streaming, and Mahout for data analysis.

Examples and demonstrations will be given in class. Students will also have an opportunity to practice with sample code.  In order to participate, students should have a basic knowledge of computer programming and linux. 

This course will be presented to students in-person and via webcast.  

Registration is closed.

Click Here to access the link for Chat Support.


Introduction to Hadoop for Data Analysis PDF

Connecting to Wrangler PDF



Jason Allison
Advanced Scientific Computing
Senior Program Coordinator