This event has ended. Visit the official site or create your own event on Sched.
Click here to return to main conference site. For a one page, printable overview of the schedule, see this.
Back To Schedule
Monday, June 27 • 2:30pm - 4:00pm
Introduction to SparkR (Part 2)

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Spark is a popular cluster computing framework used for performing large scale data analysis. This tutorial will introduce cluster computing using SparkR: the R language API for Spark. SparkR provides a distributed data frame API that enables structured data processing with a syntax familiar to R users. In this tutorial we will provide example workflows for ingesting data, performing data analysis and doing interactive queries using distributed data frames. Finally, participants will be able to try SparkR on realworld datasets using Databricks R notebooks to get hands-on experience using SparkR.

For details, refer to tutorial description.


Hossein Falaki

Databricks Inc.

Monday June 27, 2016 2:30pm - 4:00pm PDT
Econ 140