Codestin Search App

rjurney · 2025-02-25T16:04:59Z

What changes were proposed in this pull request?

I converted the unittest/nose tests to pytest.

Why are the changes needed?

This is important because nose does not support Python 3.10, which means we can't either and still run unit tests.

…s.txt and split out requirements-dev.txt. Version bumps.

…ney/build-upgrades

…mes stackexchange' command.

…ney/pytest

SemyonSinchenko · 2025-02-26T00:02:34Z

python/graphframes/tests.py

Let's move it to python/tests/test.py?

We should really split it up into parts, but I want to wait and just keep it tests.py for now. I'll do another PR soon to break it up into like tests/connected_components.py and stuff.

SemyonSinchenko · 2025-02-26T00:03:54Z

python/graphframes/tests.py

+            "spark.submit.pyFiles",
+            os.path.abspath("python/dist/graphframes-0.8.4-py3-none-any.whl"),
+        )
+        cls.sc = SparkContext(master="local[4]", appName="GraphFramesTests", conf=cls.conf)


Can we avoid any usages of the SparkContext at all and rely only on a SparkSession? It allows smoothly test both spark classic and spark connect. You can see how it is done in #506

@SemyonSinchenko do you mean this? Can I do the equivalent in Python?

case MethodCase.CONNECTED_COMPONENTS => { val cc = apiMessage.getConnectedComponents graphFrame.connectedComponents .setAlgorithm(cc.getAlgorithm) .setCheckpointInterval(cc.getCheckpointInterval) .setBroadcastThreshold(cc.getBroadcastThreshold) .run() }

@SemyonSinchenko it looks like this code was removed, so no longer an issue?

@SemyonSinchenko sorry, what about this? I was confused - I see checkpointing happening elsewhere.

cls.spark._jsc.setCheckpointDir(cls.checkpointDir)

Can we avoid any usages of the SparkContext at all and rely only on a SparkSession? It allows smoothly test both spark classic and spark connect. You can see how it is done in #506

I don't think so, no. At least I do not know how to accomplish that. My attempt went badly :)

SemyonSinchenko · 2025-02-26T00:05:13Z

python/graphframes/tests.py

+def spark_session():
+    # Create a SparkSession with a smaller number of shuffle partitions.
+    spark = (
+        SparkSession(GraphFrameTestUtils.sc)


We can just explicitly pass all the same confs (app name, etc.) to the SparkSession.builder

Sorry, what do you want me to do here? :) I don't understand.

@SemyonSinchenko I want to skip this comment... it is hard to keep things working once I monkey with how this code works. I spent an hour and I'd rather leave this alone, ship pytest and then Spark Connect :)

SemyonSinchenko · 2025-02-26T00:06:08Z

python/graphframes/tests.py

+        cls.conf = SparkConf().setAppName("GraphFramesTests")
+        cls.conf.set(
+            "spark.submit.pyFiles",
+            os.path.abspath("python/dist/graphframes-0.8.4-py3-none-any.whl"),


Why do we need it? If the project is installed via poetry install to the same venv, like pyspark, it will be here anyway.

It is gone!

SemyonSinchenko · 2025-02-26T00:09:27Z

python/graphframes/tests.py

+        ranks = (
+            graph.pregel.setMaxIter(5)
+            .withVertexColumn(
+                "rank",
+                F.lit(1.0 / numVertices),
+                F.coalesce(Pregel.msg(), F.lit(0.0)) * F.lit(1.0 - alpha)
+                + F.lit(alpha / numVertices),
+            )
+            .sendMsgToDst(Pregel.src("rank") / Pregel.src("outDegree"))
+            .aggMsgs(F.sum(Pregel.msg()))
            .run()
+        )


Suggested change

ranks = (

graph.pregel.setMaxIter(5)

.withVertexColumn(

"rank",

F.lit(1.0 / numVertices),

F.coalesce(Pregel.msg(), F.lit(0.0)) * F.lit(1.0 - alpha)

+ F.lit(alpha / numVertices),

)

.sendMsgToDst(Pregel.src("rank") / Pregel.src("outDegree"))

.aggMsgs(F.sum(Pregel.msg()))

.run()

)

pregel = graph.pregel

ranks = (

pregel.setMaxIter(5)

.withVertexColumn(

"rank",

F.lit(1.0 / numVertices),

F.coalesce(pregel.msg(), F.lit(0.0)) * F.lit(1.0 - alpha)

+ F.lit(alpha / numVertices),

)

.sendMsgToDst(pregel.src("rank") / pregel.src("outDegree"))

.aggMsgs(F.sum(pregel.msg()))

.run()

)

Because with support of SparkConnect there will be two possible types (Pregel classic and Pregel connect)

@SemyonSinchenko I did this one.

…ney/pytest

rjurney · 2025-03-06T02:42:19Z

@SemyonSinchenko I'm starting my review of your PR now, it will take a few days... but do you mind if we get this one in first?

rjurney · 2025-03-06T08:03:08Z

@SemyonSinchenko trying to finish this as quickly as possible, almost got it done... waiting to see if tests pass :)

rjurney · 2025-03-06T15:55:41Z

Whoah, it builds! @SemyonSinchenko will definitely get your comments addressed this morning so we can merge and I can review your updated PR for Connect!

…ney/pytest

…nto rjurney/pytest

This reverts commit d696462.

This reverts commit f24ef7f.

This reverts commit b1fd254.

SemyonSinchenko

LGTM overall! I think we should merge it because in #506 I need to change testing fixtures again and there is no clear way to avoid it. So, let's merge it?

rjurney · 2025-03-07T16:17:41Z

@SemyonSinchenko thanks for the flexibility - merging!

rjurney added 30 commits February 15, 2025 20:58

Converted tests to pytest. Build a Python package. Update requirement…

f4e9cdb

…s.txt and split out requirements-dev.txt. Version bumps.

Restore Python .gitignore

c256244

Extra newline removed

6c3df0b

Merge branch 'master' of github.com:graphframes/graphframes into rjur…

b2838d2

…ney/build-upgrades

Added VERSION file set to 0.8.5

caf5091

isort; fiex edgesDF variable name.

7cfa2d1

Merge branch 'master' of github.com:graphframes/graphframes into rjur…

2ca9a15

…ney/build-upgrades

Back out Dockerfile changes

a8bf0be

Back out version change in build.sbt

54a942d

Backout changes to config and run-tests

8b0e346

Back out pytest conversion

46c2b93

Back out version changes to make nose tests pass

18b5da0

Remove changes to requirements

8eca097

Put nose back in requirements.txt

277c06f

Remove version bump to version.sbt

b55ee48

Remove packages related to testing

f8a8fd9

Remove old setup.py / setup.cfg

bc2cb36

New pyproject.toml and poetry.lock

728be33

Short README for Python package, poetry won't allow a ../README.md path

3cea1a8

Remove requirements files in favor of pyproject.toml

87cc975

Try to poetrize CI build

6f84a5a

pyspark min 3.4

9a8eef0

Local python README in pyproject.toml

75ecd99

Trying to remove he working folder to debug scala issue

80231d0

Set Python working directory again

2a9170b

Accidental newline

3de2263

Install Python for test...

4662717

Run tests from python/ folder

1b7b9f8

Try running tests from python/

58da493

poetry run the unit tests

9f4aa24

rjurney added 4 commits February 21, 2025 12:11

Now retry three times if we can't connect for any reason in 'graphfra…

3788941

…mes stackexchange' command.

Merge master

e95bbbe

Merge branch 'master' of github.com:graphframes/graphframes into rjur…

2372231

…ney/pytest

Convert nose/unittest tests to pytest

223bb8a

rjurney changed the title ~~Convert nose/unittest tests to pytest~~ Convert nose / unittest tests to pytest Feb 25, 2025

+pytest

9c9c2fb

SemyonSinchenko reviewed Feb 26, 2025

View reviewed changes

Merge branch 'master' of github.com:graphframes/graphframes into rjur…

1b786fb

…ney/pytest

rjurney added 3 commits March 5, 2025 23:57

Merge main

265fea0

black formatted unit tests

9023261

Make tests pass flake8

bb72c26

Update tests.py to remove stale self.assertEquals

f3983b5

rjurney added 11 commits March 6, 2025 08:02

Merge branch 'master' of github.com:graphframes/graphframes into rjur…

86a7c49

…ney/pytest

Merge branch 'rjurney/pytest' of github.com:graphframes/graphframes i…

d06d0db

…nto rjurney/pytest

Responding to a couple comments

b1fd254

Removed unused SparkContext reference

f24ef7f

Update version test code

d696462

Revert "Update version test code"

24ea47e

This reverts commit d696462.

Revert "Removed unused SparkContext reference"

aa6ad40

This reverts commit f24ef7f.

Revert "Responding to a couple comments"

9621ae9

This reverts commit b1fd254.

Responding to comment

b74bcdc

+black format

200152e

Update tests.py to remove unused import

45d58bb

SemyonSinchenko approved these changes Mar 7, 2025

View reviewed changes

rjurney merged commit 65a654e into master Mar 7, 2025
10 checks passed

rjurney deleted the rjurney/pytest branch April 15, 2025 00:32

Conversation

rjurney commented Feb 25, 2025

What changes were proposed in this pull request?

Why are the changes needed?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rjurney Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rjurney Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rjurney commented Mar 6, 2025

Uh oh!

rjurney commented Mar 6, 2025

Uh oh!

rjurney commented Mar 6, 2025

Uh oh!

SemyonSinchenko left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rjurney commented Mar 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rjurney Mar 7, 2025 •

edited

Loading

rjurney Mar 7, 2025 •

edited

Loading