Thanks to visit codestin.com
Credit goes to github.com

Skip to content

phatak-dev/introduction-to-spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository contains example code and sample data for Introduction to Apache Spark session. Follow the below steps to setup spark on your machine. Also clone this repository to get code and data.

1. Installing Spark

Step 1.1 : Prerequisites

You should have Java installed on your machine.

Step 1.2 : Download Spark from Apache website

Download spark from apache website

Step 1.3: Install

Linux and Mac

tar -zxvf  spark-1.2.1-bin-hadoop2.4.tgz

Windows

  • Use 7zip to extract the file. You have to extract twice.

  • Copy spark-1.2.1-bin-hadoop2.4 folder to root of C drive.

2. Running spark-shell

Cd to spark-1.2.1-bin-hadoop2.4 folder

Start spark-shell.

Linux and Mac

	bin/spark-shell

Windows

	bin\spark-shell.cmd

If spark-shell starts successfully , then spark is installed on your machine.

3. Getting code

Clone this repository using following command

    git clone https://github.com/phatak-dev/introduction-to-spark.git

to get example code and sample data.

4. Up to date

Please pull before coming to the session to get the latest code.

About

Code and setup information for Introduction to Spark session at http://www.meetup.com/Bangalore-Apache-Spark-Meetup/events/220270823/

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published