feat: Allow iloc to support lists of negative indices #1497

Genesis929 · 2025-03-17T22:51:02Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

GarrettWu · 2025-03-17T22:58:10Z

tests/system/small/test_dataframe.py

@@ -4409,7 +4409,7 @@ def test_loc_list_multiindex(scalars_dfs_maybe_ordered):


 def test_iloc_list(scalars_df_index, scalars_pandas_df_index):
-    index_list = [0, 0, 0, 5, 4, 7]
+    index_list = [0, 0, 0, 5, 4, 7, -2, -5, 3]


Do we support base case of iloc[neg_number]?

Seems works.

TrevorBergeron · 2025-03-19T00:59:06Z

bigframes/core/indexers.py

+        if not is_key_unisigned or key[0] < 0:
+            neg_block, _ = block.apply_window_op(
+                offsets_id,
+                ops.aggregations.ReverseRowNumberOp(),


Do we need a new op? Or could we have just used existing ops?

Updated, using SizeUnaryOp and SubOp instead.

TrevorBergeron · 2025-03-19T01:05:19Z

bigframes/core/indexers.py

+        elif "shape" in series_or_dataframe._block.__dict__:
+            # If there is a cache, we convert all indices to positive.
+            row_count = series_or_dataframe._block.shape[0]
+            key = [k if k >= 0 else row_count + k for k in key]
+            is_key_unisigned = True


This seems a bit fragile. We can use block.expr.node.row_count, but going though shape depends on some implementation details that might change. I don't know if we necessarily need this optimization at all?

TrevorBergeron · 2025-03-19T20:28:30Z

bigframes/core/indexers.py

@@ -477,6 +478,19 @@ def _iloc_getitem_series_or_dataframe(
                Union[bigframes.dataframe.DataFrame, bigframes.series.Series],
                series_or_dataframe.iloc[0:0],
            )
+
+        # Check if both positive index and negative index are necessary
+        if isinstance(key, bigframes.series.Series):


might need to check for bigframes.Index as well? Or maybe we should have a helper that helps identify an "remote" or "large" object we don't want to iterate over

Updated to also check index type.

For large object it's may not be necessary, in cloudtop, I tried 1 million keys(which have the same sign), and this process took 0.03s.

* feat: support iloc with negative indices * update partial ordering test * update naming * update logic * update comment * update logic and tests * update filter

Genesis929 added 2 commits March 17, 2025 22:49

feat: support iloc with negative indices

d6bc14f

update partial ordering test

d57ed9b

product-auto-label bot added size: s Pull request size is small. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Mar 17, 2025

Genesis929 marked this pull request as ready for review March 17, 2025 22:51

Genesis929 requested review from a team as code owners March 17, 2025 22:51

Genesis929 requested a review from shuoweil March 17, 2025 22:51

blunderbuss-gcf bot assigned jiaxunwu Mar 17, 2025

Genesis929 requested a review from GarrettWu March 17, 2025 22:51

update naming

d839ce8

Genesis929 changed the title ~~Iloc neg huanc~~ feat: support iloc with negative indices Mar 17, 2025

Merge branch 'main' into iloc_neg_huanc

26f565f

Genesis929 changed the title ~~feat: support iloc with negative indices~~ feat: iloc now supports negative indices Mar 17, 2025

GarrettWu approved these changes Mar 17, 2025

View reviewed changes

Genesis929 changed the title ~~feat: iloc now supports negative indices~~ feat: iloc now supports list of negative indices Mar 17, 2025

Genesis929 changed the title ~~feat: iloc now supports list of negative indices~~ feat: Allow iloc to support lists of negative indices Mar 17, 2025

Genesis929 requested a review from TrevorBergeron March 17, 2025 23:30

update logic

ea61d07

product-auto-label bot added size: m Pull request size is medium. and removed size: s Pull request size is small. labels Mar 18, 2025

update comment

fade680

TrevorBergeron reviewed Mar 19, 2025

View reviewed changes

Genesis929 and others added 2 commits March 19, 2025 00:41

Merge branch 'main' into iloc_neg_huanc

fc9ca42

update logic and tests

7f58504

shuoweil approved these changes Mar 19, 2025

View reviewed changes

Merge branch 'main' into iloc_neg_huanc

c74c586

Genesis929 requested a review from TrevorBergeron March 19, 2025 19:38

TrevorBergeron approved these changes Mar 19, 2025

View reviewed changes

update filter

130fde3

Merge branch 'main' into iloc_neg_huanc

1f0318a

Genesis929 enabled auto-merge (squash) March 19, 2025 20:48

Genesis929 merged commit a9cf215 into main Mar 19, 2025
18 of 24 checks passed

Genesis929 deleted the iloc_neg_huanc branch March 19, 2025 21:20

release-please bot mentioned this pull request Mar 19, 2025

chore(main): release 1.42.0 #1508

Merged

release-please bot mentioned this pull request Mar 28, 2025

chore(v1): release 1.42.0 #1567

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Allow iloc to support lists of negative indices #1497

feat: Allow iloc to support lists of negative indices #1497

Genesis929 commented Mar 17, 2025

GarrettWu Mar 17, 2025

Genesis929 Mar 17, 2025

TrevorBergeron Mar 19, 2025

Genesis929 Mar 19, 2025

TrevorBergeron Mar 19, 2025

Genesis929 Mar 19, 2025

TrevorBergeron Mar 19, 2025

Genesis929 Mar 19, 2025

feat: Allow iloc to support lists of negative indices #1497

feat: Allow iloc to support lists of negative indices #1497

Conversation

Genesis929 commented Mar 17, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment