Thanks to visit codestin.com
Credit goes to github.com

Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
64 commits
Select commit Hold shift + click to select a range
f4e9cdb
Converted tests to pytest. Build a Python package. Update requirement…
rjurney Feb 16, 2025
c256244
Restore Python .gitignore
rjurney Feb 16, 2025
6c3df0b
Extra newline removed
rjurney Feb 16, 2025
b2838d2
Merge branch 'master' of github.com:graphframes/graphframes into rjur…
rjurney Feb 16, 2025
caf5091
Added VERSION file set to 0.8.5
rjurney Feb 16, 2025
7cfa2d1
isort; fiex edgesDF variable name.
rjurney Feb 16, 2025
2ca9a15
Merge branch 'master' of github.com:graphframes/graphframes into rjur…
rjurney Feb 16, 2025
a8bf0be
Back out Dockerfile changes
rjurney Feb 16, 2025
54a942d
Back out version change in build.sbt
rjurney Feb 16, 2025
8b0e346
Backout changes to config and run-tests
rjurney Feb 16, 2025
46c2b93
Back out pytest conversion
rjurney Feb 16, 2025
18b5da0
Back out version changes to make nose tests pass
rjurney Feb 16, 2025
8eca097
Remove changes to requirements
rjurney Feb 16, 2025
277c06f
Put nose back in requirements.txt
rjurney Feb 16, 2025
b55ee48
Remove version bump to version.sbt
rjurney Feb 16, 2025
f8a8fd9
Remove packages related to testing
rjurney Feb 16, 2025
bc2cb36
Remove old setup.py / setup.cfg
rjurney Feb 16, 2025
728be33
New pyproject.toml and poetry.lock
rjurney Feb 16, 2025
3cea1a8
Short README for Python package, poetry won't allow a ../README.md path
rjurney Feb 16, 2025
87cc975
Remove requirements files in favor of pyproject.toml
rjurney Feb 16, 2025
6f84a5a
Try to poetrize CI build
rjurney Feb 16, 2025
9a8eef0
pyspark min 3.4
rjurney Feb 16, 2025
75ecd99
Local python README in pyproject.toml
rjurney Feb 16, 2025
80231d0
Trying to remove he working folder to debug scala issue
rjurney Feb 16, 2025
2a9170b
Set Python working directory again
rjurney Feb 16, 2025
3de2263
Accidental newline
rjurney Feb 16, 2025
4662717
Install Python for test...
rjurney Feb 17, 2025
1b7b9f8
Run tests from python/ folder
rjurney Feb 17, 2025
58da493
Try running tests from python/
rjurney Feb 17, 2025
9f4aa24
poetry run the unit tests
rjurney Feb 17, 2025
11b2782
poetry run the tests
rjurney Feb 17, 2025
9772344
Try just using 'python' instead of a path
rjurney Feb 17, 2025
d55dbfe
poetry run the last line, graphframes.main
rjurney Feb 17, 2025
2fc4d08
Remove test/ folder from style paths, it doesn't exist
rjurney Feb 17, 2025
8297a13
Remove .vscode
rjurney Feb 17, 2025
2035d98
VERSION back to 0.8.4
rjurney Feb 17, 2025
f9f4bd7
Remove tutorials reference
rjurney Feb 17, 2025
9ddd6b2
VERSION is a Python thing, it belongs in python/
rjurney Feb 17, 2025
7065647
Include the README.md and LICENSE in the Python package
rjurney Feb 17, 2025
a6c7e91
Some classifiers for pyproject.toml
rjurney Feb 17, 2025
51e3e6d
Trying poetry install action instead of manual install
rjurney Feb 17, 2025
272be06
Removing SPARK_HOME
rjurney Feb 17, 2025
4587999
Returned SPARK_HOME settings
rjurney Feb 17, 2025
2422b22
Minimized the PR to just these files
rjurney Feb 17, 2025
073dced
Merge in rjurney/build-upgrades and in turn master
rjurney Feb 17, 2025
0a1faba
Created tutorials dependency group to minimize main bloat
rjurney Feb 17, 2025
c0d6d7b
Make motif.py execute in whole again
rjurney Feb 17, 2025
5bb4c26
Minor isort format and cleanup of download.py
rjurney Feb 17, 2025
99e6a4d
Minor isort format and cleanup of utils.py
rjurney Feb 17, 2025
662e197
Removed case sensitivity from the script - that was confusing people …
rjurney Feb 17, 2025
beaa35d
motif.py now matches tutorial code, runs and handles case insensitivity.
rjurney Feb 17, 2025
1bf4a9e
Regenerate poetry.lock
rjurney Feb 21, 2025
ef19784
Setup a 'graphframes stackexchange' comand.
rjurney Feb 21, 2025
4400cb4
Make graphframes.tutorials.motif use a checkpoint dir unique, and fro…
rjurney Feb 21, 2025
d549c56
Use spark.sparkContext.setCheckpointDir directly instead of instantia…
rjurney Feb 21, 2025
b970636
Using 'from __future__ import annotations' intsead of List and Tuple
rjurney Feb 21, 2025
3788941
Now retry three times if we can't connect for any reason in 'graphfra…
rjurney Feb 21, 2025
e95bbbe
Merge master
rjurney Feb 25, 2025
413a915
Merge branch 'master' of github.com:graphframes/graphframes
rjurney Mar 8, 2025
b6e8c27
Merge branch 'master' of github.com:graphframes/graphframes
rjurney Mar 18, 2025
d4dccd3
Ignore spark-warehouse/
rjurney Jul 12, 2025
90001ee
Added urls for Pluralsight GraphFrames classes
rjurney Jul 12, 2025
789e318
Trailing newline
rjurney Jul 12, 2025
6be4a68
Merge main branch to .gitignore/README.md and resolve conflicts
rjurney Jul 14, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -54,5 +54,8 @@ python/graphframes/resources/*
# tmp data for spark connect
tmp/*

# Spark warehouse folder
spark-warehouse

# db-connect targets
graphframes-connect-databricks/*
24 changes: 14 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,9 +9,9 @@

This is a package for graphs processing and analytics on scale. It is built on top of Apache Spark and relies on DataFrame abstraction. Users can write highly expressive queries by leveraging the DataFrame API, combined with a new API for network motif finding. The user also benefits from DataFrame performance optimizations within the Spark SQL engine. GraphFrames works in Java, Scala, and Python.

You can find user guide and API docs at https://graphframes.io
You can find user guide and API docs at <https://graphframes.io>

## GraphFrames is Back!
## GraphFrames is Back

This projects was in maintenance mode for some time, but we are happy to announce that it is now back in active development! We are working on a new release with many bug fixes and improvements. We are also working on a new website and documentation.

Expand Down Expand Up @@ -135,6 +135,7 @@ g.connectedComponents().show()
## Learn GraphFrames

To learn more about GraphFrames, check out these resources:

* [GraphFrames Documentation](https://graphframes.github.io/graphframes)
* [GraphFrames Network Motif Finding Tutorial](https://graphframes.github.io/graphframes/docs/_site/motif-tutorial.html)
* [Introducing GraphFrames](https://databricks.com/blog/2016/03/03/introducing-graphframes.html)
Expand All @@ -144,6 +145,8 @@ To learn more about GraphFrames, check out these resources:

* [GraphFrames Google Group](https://groups.google.com/forum/#!forum/graphframes)
* [#graphframes Discord Channel on GraphGeeks](https://discord.com/channels/1162999022819225631/1326257052368113674)
* [Graph Operations in Apache Spark Using GraphFrames](https://www.pluralsight.com/courses/apache-spark-graphframes-graph-operations)
* [Executing Graph Algorithms with GraphFrames on Databricks](https://www.pluralsight.com/courses/executing-graph-algorithms-graphframes-databricks)

## `graphframes-py` is our Official PyPi Package

Expand All @@ -153,7 +156,7 @@ We recommend using the Spark Packages system to install the latest version of Gr
pip install graphframes-py
```

This project does not own or control the [graphframes PyPI package](https://pypi.org/project/graphframes/) (installs 0.6.0) or [graphframes-latest PyPI package](https://pypi.org/project/graphframes-latest/) (installs 0.8.4).
This project does not own or control the [graphframes PyPI package](https://pypi.org/project/graphframes/) (installs 0.6.0) or [graphframes-latest PyPI package](https://pypi.org/project/graphframes-latest/) (installs 0.8.4).

## GraphFrames and sbt

Expand Down Expand Up @@ -210,7 +213,7 @@ GraphFrames was made as collaborative effort among UC Berkeley, MIT, Databricks

See [contribution guide](./CONTRIBUTING.md)

## Releases:
## Releases

See [release notes](https://github.com/graphframes/graphframes/releases).

Expand All @@ -221,9 +224,10 @@ Please read [this section](https://central.sonatype.org/publish/publish-portal-s

GroupId: `io.graphframes`
ArtifactIds:
- `graphframes-spark3_2.12`
- `graphframes-spark3_2.13`
- `graphframes-connect-spark3_2.12`
- `graphframes-connect-spark3_2.13`
- `graphframes-spark4_2.13`
- `graphframes-connect-spark4_2.13`

* `graphframes-spark3_2.12`
* `graphframes-spark3_2.13`
* `graphframes-connect-spark3_2.12`
* `graphframes-connect-spark3_2.13`
* `graphframes-spark4_2.13`
* `graphframes-connect-spark4_2.13`