Thanks to visit codestin.com
Credit goes to github.com

Skip to content

MobyWare/ml-spark-intro-version-2.0

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Experimenting with Apache Spark 2.0

Goals

I am most interested in in the new pipelines for MLLib and the SQL interface.

There are a few notebooks with my worked examples of querying data loaded from a CSV file using the dataframe-API. I also train a few models using the pipeline framework. It's pretty slick.

Set up

You can run the code in a docker image I got from here. I volumed in the contents of the work folder to the /home/jovyan directory. that allows your notebooks and data. You can use the command:

docker run ezamir/jupyter-spark-2.0 -p 8888:8888 -v path/to/your/host/directory:/home/jovyan/work

About

Start of my code to check out new features in version 2.0. I'm looking into pipelines and SQL interface

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •