Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Commit 48fe6ec

Browse files
author
Dean Wampler
committed
Updated the discussion of notebook options
1 parent 2ebd689 commit 48fe6ec

File tree

1 file changed

+11
-12
lines changed

1 file changed

+11
-12
lines changed

README.markdown

Lines changed: 11 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -4,18 +4,18 @@
44

55
![](http://spark.apache.org/docs/latest/img/spark-logo-hd.png)
66

7-
Dean Wampler, Ph.D.<br/>
8-
[Lightbend](http://lightbend.com)<br/>
9-
[dean.wampler@lightbend.com](mailto:dean.wampler@lightbend.com)<br/>
7+
Dean Wampler<br/>
8+
[Anyscale](http://anyscale.com)<br/>
9+
[dean@anyscale.com](mailto:dean@anyscale.com)<br/>
1010
[@deanwampler](https://twitter.com/deanwampler)
1111

1212
This tutorial demonstrates how to write and run [Apache Spark](http://spark.apache.org) applications using Scala with some SQL. I also teach a little Scala as we go, but if you already know Spark and you are more interested in learning just enough Scala for Spark programming, see my other tutorial [Just Enough Scala for Spark](https://github.com/deanwampler/JustEnoughScalaForSpark).
1313

1414
This tutorial demonstrates how to write and run [Apache Spark](http://spark.apache.org) applications using Scala with some SQL. You can run the examples and exercises several ways:
1515

16-
1. [Jupyter notebooks](http://jupyter.org/) - The easiest way, especially for data scientists accustomed to _notebooks_
17-
2. In an IDE, like [IntelliJ](https://www.jetbrains.com/idea/) - Familiar for developers
18-
3. At the terminal prompt using the build tool [SBT](https://www.scala-sbt.org/)
16+
1. Notebooks, like [Jupyter](http://jupyter.org/) - The easiest way, especially for data scientists accustomed to _notebooks_.
17+
2. In an IDE, like [IntelliJ](https://www.jetbrains.com/idea/) - Familiar for developers.
18+
3. At the terminal prompt using the build tool [SBT](https://www.scala-sbt.org/).
1919

2020
This tutorial is mostly about learning Spark, but I teach you a little Scala as we go. If you are more interested in learning just enough Scala for Spark programming, see my new tutorial [Just Enough Scala for Spark](https://github.com/deanwampler/spark-scala-tutorial).
2121

@@ -46,12 +46,12 @@ Begin by cloning or downloading the tutorial GitHub project [github.com/deanwamp
4646

4747
Now Pick the way you want to work through the tutorial:
4848

49-
1. Jupyter notebooks - Go [here](#use-jupyter-notebooks)
49+
1. Notebooks - Go [here](#use-notebooks)
5050
2. In an IDE, like IntelliJ - Go [here](#use-ide)
5151
3. At the terminal prompt using SBT - Go [here](#use-sbt)
5252

53-
<a name="use-jupyter-notebooks"></a>
54-
## Using Jupyter Notebooks
53+
<a name="use-notebooks"></a>
54+
## Using Notebooks
5555

5656
The easiest way to work with this tutorial is to use a [Docker](https://docker.com) image that combines the popular [Jupyter](http://jupyter.org/) notebook environment with all the tools you need to run Spark, including the Scala language. It's called the [all-spark-notebook](https://hub.docker.com/r/jupyter/all-spark-notebook/). It bundles [Apache Toree](https://toree.apache.org/) to provide Spark and Scala access.
5757
The [webpage](https://hub.docker.com/r/jupyter/all-spark-notebook/) for this Docker image discusses useful information like using Python as well as Scala, user authentication topics, running your Spark jobs on clusters, rather than local mode, etc.
@@ -60,13 +60,12 @@ There are other notebook options you might investigate for your needs:
6060

6161
**Open source:**
6262

63-
* [Jupyter](https://ipython.org/) + [BeakerX](http://beakerx.com/) - a powerful set of extensions for Jupyter
63+
* [Polynote](https://polynote.org/) - A cross-language notebook environment with built-in Scala support. Developed by Netflix.
64+
* [Jupyter](https://ipython.org/) + [BeakerX](http://beakerx.com/) - a powerful set of extensions for Jupyter.
6465
* [Zeppelin](http://zeppelin-project.org/) - a popular tool in big data environments
65-
* [Spark Notebook](http://spark-notebook.io) - a powerful tool, but not as polished or well maintained
6666

6767
**Commercial:**
6868

69-
* [IBM Data Science Experience](http://datascience.ibm.com/) - IBM's full-featured environment for data science
7069
* [Databricks](https://databricks.com/) - a feature-rich, commercial, cloud-based service
7170

7271
## Installing Docker and the Jupyter Image

0 commit comments

Comments
 (0)