DOC: fix ctypes example #26989

ngoldbaum · 2024-07-19T19:24:06Z

I'm not sure if the problem with c_int32 is because of an AI hallucination or because of cross-platform issues, so I switched it to int8 which should be equivalent to c_byte on all platforms.

The ctypes.Structure that was there before was a hallucination I think?

charris · 2024-07-19T19:56:35Z

because of cross-platform issues

Might be part of it. Looks like [skip actions] [skip azp] [skip cirrus] skips the benchmark tests, perhaps we need to move that part of the doctesting?

charris · 2024-07-19T20:16:32Z

The circleci testing uses python tools/refguide_check.py -v, the failing tests look to use spin check-docs -v (scipy_doctests, looks like). I suspect the tests use different environments.

ngoldbaum · 2024-07-19T20:18:33Z

I don't have any context for why the doctests are in the benchmark test runner rather than running on circleci. @ev-br do you have context on this? Or why the doc builder is still using refguide-check.py? Maybe it should be migrated to use the new doctest runner? Or did we intentionally leave both running on CI?

charris · 2024-07-19T20:25:13Z

My understanding is that the students mostly worked on windows.

charris · 2024-07-19T21:22:09Z

Thanks Nathan.

bmwoodruff · 2024-07-19T22:29:42Z

@charris , the students for now are using a linux remote machine on Nebari for anything that involves buliding NumPy and/or running tests. I personally checked each example using the following:

spin build && python -m pip install . && spin docs && python tools/refguide_check.py --doctests && spin lint

I know there has been a change to the doctesting process recently, which could be the reason for the fail.

I know for a fact that zero output from any of our AI examples is hallucinated. The output never comes from AI, as I learned quite quickly that the AI output is completely unreliable. We first strip all generated output, and then run the AI generated (quite reliable) code against the current dev build and insert that output appropriately where needed.

One side effect of the AI gen process I've followed is that occasionally we encountered instances where the output from valid working code does not match what the doctester expects. The issue is reproducible in various environments, and doesn't seem to be dependent on platform. This happened with one of the ma functions, see possee-org/genai-numpy#109 for an example. I plan to submit an official issue to Numpy on this topic next week, if the interns don't do so before their internship ends.

ngoldbaum · 2024-07-19T22:33:35Z

I don't think spin lint ever ran the doctests. The command for that is now spin check-docs, please integrate that into your workflow. I also think maybe the doc builder should run that command too?

Sorry for assuming what I was seeing must have been hallucinated.

bmwoodruff · 2024-07-19T22:37:37Z

Will do. The spin check-docs option became available after the original PRs were submitted. I'm checking the tests locally here right now, and I'll have @otieno-juma check them on his end as well after doing a rebase.

The AI generated "output" is quite hilarious sometimes. It's ability to count to 10 is dismal. But it does a great job of generating working code (just don't trust the output).

charris · 2024-07-19T22:50:30Z

@bmwoodruff spin check-docs has some dependencies, looks like

pip install scipy-doctest hypothesis matplotlib scipy pytz pandas

You probably don't need all of those if you make sure to just test the relevant module.

bmwoodruff · 2024-07-20T00:15:40Z

I'm getting the same errors you guys are seeing using spin check-docs. After too much time spent tracking down why this happened, I don't think it's an environment issue, nor an issue with the AI code processing we are doing. I think the error crept in when a rebase on the example was done.

When i run our AI parser code (just did a bit ago), against the current dev build of Numpy, I get the following:

AI Generated code log:

text = """
        Converting a simple dtype:
        
        >>> dt = np.dtype('i4')
        >>> ctype = np.ctypeslib.as_ctypes_type(dt)
        >>> ctype
        <class 'ctypes.c_int32'>

        Converting a structured dtype:
        
        >>> dt = np.dtype([('x', 'i4'), ('y', 'f4')])
        >>> ctype = np.ctypeslib.as_ctypes_type(dt)
        >>> ctype
        <class 'ctypes.Structure'>
"""
print(clean_and_process_text(text))

Here's the cleaned up output that should have been in the PR.

        Converting a simple dtype:

        >>> dt = np.dtype('i4')
        >>> ctype = np.ctypeslib.as_ctypes_type(dt)
        >>> ctype
        <class 'ctypes.c_int'>

        Converting a structured dtype:

        >>> dt = np.dtype([('x', 'i4'), ('y', 'f4')])
        >>> ctype = np.ctypeslib.as_ctypes_type(dt)
        >>> ctype
        <class 'struct'>

The cleaned code output matches the images on the review I did of this PR prior to submission, as well as the initial submission of #26827. However, the actual code in the merged version version of the PR matches the garbage AI output (a force pushed rebase on main hides the history here). @otieno-juma , did something happen when you rebased on main?

My best guess is that when when @otieno-juma did a rebase, he may have reverted to something that contained the original code. I did not confirm this rebase before @charris saw it (not sure how to implement that into our workflow once the code hits numpy/numpy as PR). As the images match the cleaned output, but the merged code matches the AI output, then this seems like human error. I'm sorry that I missed that.

@otieno-juma , when you rebase the other examples, please remember to run all tests (use spin check-docs instead of the refguide_check.py command). Then ping me once you push your changes. I'll review them before @charris looks at them.

bmwoodruff · 2024-07-20T00:30:38Z

Sorry for assuming what I was seeing must have been hallucinated

@ngoldbaum, Sorry that you needed to assume this. You were correct here, as somehow in the final steps of the review process the wrong code got put back into the PR. We tried to prevent hallucinations from hitting the code base....

ev-br · 2024-07-20T08:26:46Z

I don't have any context for why the doctests are in the benchmark test runner rather than running on circleci. @ev-br do you have context on this? Or why the doc builder is still using refguide-check.py? Maybe it should be migrated to use the new doctest runner? Or did we intentionally leave both running on CI?

@ngoldbaum two things:

refguide-check.py does not run any doctests anymore; it's literally only checking consistency of the reference guide --- that the routine lists in __init__.py docstrings agree with __all__ lists.
spin check-docs could just as well run on CircleCI indeed. The only reason it's somewhere else is to avoid rebuilding numpy: CircleCI does not use spin build.

github-actions bot added the 04 - Documentation label Jul 19, 2024

ngoldbaum mentioned this pull request Jul 19, 2024

DOC: AI-Gen examples ctypeslib.as_ctypes_types #26827

Merged

DOC: fix ctypes example

de8528b

ngoldbaum force-pushed the fix-ctypes-example branch from 087e878 to de8528b Compare July 19, 2024 20:15

ngoldbaum mentioned this pull request Jul 19, 2024

MAINT: mark scipy-openblas nightly tests as allowed to fail #26991

Merged

charris merged commit 495fe43 into numpy:main Jul 19, 2024
67 of 68 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

DOC: fix ctypes example #26989

DOC: fix ctypes example #26989

Uh oh!

ngoldbaum commented Jul 19, 2024 •

edited

Loading

Uh oh!

charris commented Jul 19, 2024

Uh oh!

charris commented Jul 19, 2024

Uh oh!

ngoldbaum commented Jul 19, 2024

Uh oh!

charris commented Jul 19, 2024

Uh oh!

Uh oh!

charris commented Jul 19, 2024

Uh oh!

bmwoodruff commented Jul 19, 2024

Uh oh!

ngoldbaum commented Jul 19, 2024

Uh oh!

bmwoodruff commented Jul 19, 2024

Uh oh!

charris commented Jul 19, 2024

Uh oh!

bmwoodruff commented Jul 20, 2024

Uh oh!

bmwoodruff commented Jul 20, 2024

Uh oh!

ev-br commented Jul 20, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

DOC: fix ctypes example #26989

DOC: fix ctypes example #26989

Uh oh!

Conversation

ngoldbaum commented Jul 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

charris commented Jul 19, 2024

Uh oh!

charris commented Jul 19, 2024

Uh oh!

ngoldbaum commented Jul 19, 2024

Uh oh!

charris commented Jul 19, 2024

Uh oh!

Uh oh!

charris commented Jul 19, 2024

Uh oh!

bmwoodruff commented Jul 19, 2024

Uh oh!

ngoldbaum commented Jul 19, 2024

Uh oh!

bmwoodruff commented Jul 19, 2024

Uh oh!

charris commented Jul 19, 2024

Uh oh!

bmwoodruff commented Jul 20, 2024

Uh oh!

bmwoodruff commented Jul 20, 2024

Uh oh!

ev-br commented Jul 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ngoldbaum commented Jul 19, 2024 •

edited

Loading

ev-br commented Jul 20, 2024 •

edited

Loading