Executing and capturing rich output using notebook / nbconvert #421

maartenbreddels · 2018-10-12T09:10:07Z

@lesteve : @SylvainCorlay suggested I ping you to include you in the conversation

Dear all,

Let me start with saying this is a really awesome project, and I've already managed to get it working with ipyolume (#420) quite easily.
Together with @SylvainCorlay we have been thinking of how we can have general ipywidgets support, and even xwidget (using C++ kernels).
In parallel to this, there is this PR jupyter/jupyter-sphinx#22 that uses the notebook/nbconvert system to execute code using an arbitrary kernel. If we also take into account the discussion about including HTML output from for instance pandas dataframes, #396 you can probably see an obvious direction to take.

Using the notebook/nbconvert to execute code blocks

Leveraging nbconvert to execute code seems like a very good way forward, since you get many things for free:

Rich output, html, widgets, images (matplotlib included)
Auto display of the last expression
Different languages/kernels support

Note that we do not require the examples to be in a notebook, just that the code blocks as extracted by sphinx-gallary from the .py files to be injected into a (temporary in memory?) notebook, which can be executed by nbconvert as done in jupyter/jupyter-sphinx#22.

Screenshots/thumbnails using headless chrome

Since one of the really nice things about the sphinx-gallary is the thumbnail preview, we probably also want to have screenshots generated for rich output (such as widgets). This should be possible to do with headless chrome, as shown in #420. The method should be made more general, and also requires a headless browser (chrome?) to run at readthedocs. This

Summary

The ideal future we see would be:

less code in sphinx-gallary about executing code which should help maintainability (this should be something nbconvert is good at).
maybe improvements to nbconvert to make this happen.
More language support (Python, C++, ...)
Rich output: html, jupyter-widgets, ...
A way to generate thumbnails for rich output using headless chrome (not sure where this part wil go)
Full backward compatibily, meaning matplotlib and mayavi will continue to work.

I/we would like to know if people are on board with this idea, and maybe get input from dev main developers who may have already thought of these ideas. Looking forward to your ideas/comments/blessings/rants.

Regards,

Maarten Breddels

The text was updated successfully, but these errors were encountered:

GaelVaroquaux · 2018-10-12T09:28:44Z

Thanks for the positive feedback. One challenge is that I really don't like adding dependencies. More dependencies make setting this up in a CI more difficult. For instance it creates unsolvable constraints in the set of conda requirements (we have a lot of this in scipy-lectures, where each time we change our requirements we need to fiddle with it for a couple of hours before we get in a stable situation). nbconvert comes with a pretty deep set of dependencies. It scares me a lot. I fear that the consequence is that sphinx-gallery is going to become impossible to use for projects that already have a complex set of dependencies.

maartenbreddels · 2018-10-12T09:44:41Z

What are the dependencies that you worry about specifically? Is this something that maybe can be fixed in nbconvert, by having optional dependencies?

What I like about the suggested approach is that nbconvert solves a problem that we need, and I would rather improve the whole jupyter ecosystem (that is nbconvert) than trying to solve the same issue in multiple packages.

I feel your pain, and I'm not saying nbconvert is the only way to go, but if we can make the pain go away it will be a win for all. Do you see a way that would make everybody happy? Maybe a concrete series of steps that need to be taken before you are willing to take nbconvert as a dependency?

GaelVaroquaux · 2018-10-12T09:56:19Z

What are the dependencies that you worry about specifically?

Overall the very long list of additional ones.

Is this something that maybe can be fixed in nbconvert, by having optional dependencies?

That would absolutely help.

What I like about the suggested approach is that nbconvert solves a problem that we need, and I would rather improve the whole jupyter ecosystem (that is nbconvert) than trying to solve the same issue in multiple packages.

Yes, but the jupyter ecosystem is a fairly heavy tool :).

Maybe a concrete series of steps that need to be taken before you are willing to take nbconvert as a dependency?

The most important thing is loosening up the dependencies, both with less dependencies, and with a flexible set of required version. I just tried installing nbconvert in one of our working environments, and was really amazed by what it brought it. If you're curious, try it in the following environment: https://github.com/scipy-lectures/scipy-lecture-notes/blob/master/environment.yml I am right in parallel working to fix a broken CI on a big project, and I really frown on making this more complex. The situation is that I am the only one in the group that has enough expertise to do it.

choldgraf · 2018-10-12T16:48:14Z

These are the packages that nbconvert needed to install for me (in a fresh conda environment with only pip installed, just as an FYI.

mistune, six, webencodings, bleach, MarkupSafe, jinja2, testpath, pandocfilters, entrypoints, pygments, defusedxml, ipython-genutils, decorator, traitlets, jupyter-core, jsonschema, nbformat, nbconvert

No idea which of these is more/less beefy than others, just an FYI :-)

larsoner · 2018-12-14T16:07:56Z

One path forward is to nest any nbconvert imports in functions that need it. Then projects that don't need to use it (i.e., all existing projects) don't need to change their deps, but new projects that want it can sort it out.

It does increase our overhead a bit. However, I have a keen interest in getting this to work for MNE, so I can volunteer to take care of the sphinx-gallery CIs.

@GaelVaroquaux does that sound like a viable path forward?

@maartenbreddels still interested in working toward this?

maartenbreddels · 2018-12-17T15:38:06Z

I am working on this but trying a different, less opinionated approach.
The idea is:

Several thumnail_target_* directives that are free to do what they want (e.g. run some code in the python process, run a piece of c++ code in a Jupyter kernel) but should put a thumbnail and metadata in env.thumbnails.
Have a thumbnail directive that creates a thumbnail with the exact same style as sphinx-gallery (I have a local package setup that just uses sphinx-gallery as a dependency) and link to the targets (identified by an id/ref).

This seems to me as the path of least resistance. I'm still in doubt if I should do a PR to this repo, or if is should be a new package. I'm trying to reuse as much as possible from sphinx_gallery.

larsoner · 2018-12-17T15:55:41Z

I'm still in doubt if I should do a PR to this repo, or if is should be a new package.

This new proposal looks like a generalization of what we already have: our existing / default thumbnail_target_* would be thumbnail_target_python in this refactored code. Is that correct?

If so, then this sounds like a nice, reasonable generalization of #313. Then perhaps the nbconvert-based "thumbnailing"/running code could live as hopefully simple extensions in https://github.com/sphinx-gallery/sphinx-gallery-extra. WDYT?

larsoner · 2018-12-17T15:56:18Z

(This would allow SG to stay relatively lean and backward compatible, and s-g-e could have the heavier CI dependencies needed to test and build the docs for the nbconvert-based extensions.)

maartenbreddels · 2018-12-17T16:08:16Z

This new proposal looks like a generalization of what we already have: our existing / default thumbnail_target_* would be thumbnail_target_python in this refactored code. Is that correct?

indeed, that could be 1 implementation.

Then perhaps the nbconvert-based "thumbnailing"/running code could live as hopefully simple extensions in https://github.com/sphinx-gallery/sphinx-gallery-extra. WDYT?

That might be a good idea. I'll keep that in mind. My focus is to get something lightweight working, with minimal code and effort.

larsoner · 2018-12-17T16:11:48Z

My focus is to get something lightweight working, with minimal code and effort.

Unfortunately this will probably conflict with the refactoring idea, since I think it would be a bit of effort to get all of the interfaces/APIs right at the Sphinx-gallery end in the first place (i.e., just supporting the existing Python workflows properly). For long-term sustainability it would be nice to avoid people having to fork and/or copy-paste bits of sphinx-gallery to get the functionality that they need. So hopefully we can make it work at some point!

maartenbreddels · 2018-12-17T16:16:26Z

Absolutely agree that would be perfect, but I cannot make promises. The most practical solution for me is I push this to a repo, but not release on PyPI yet, or just deprecate the package once it lands in sge.

choldgraf · 2018-12-27T10:17:55Z

Just a note that this is one reason we created sphinx-gallery-extra...it's a place to play around with useful-but-less-stable code, require more complex dependencies, not support as many older versions of python, etc. Happy to iterate on contributions there @maartenbreddels

larsoner · 2019-05-19T19:10:26Z

@maartenbreddels not sure if you are still working on this or have new ideas, but basically I lay out a plan in #441 (comment) about how we can support other modules. Basically the idea is to keep code for other scrapers in the modules that need scraping as much as possible, and "just" ensure that SG provides the interfaces necessary to do so.

I think this is perhaps the direction we were already headed with the discussion here. If you have an idea for how the APIs of SG would need to be generalized for ipyvolume, let me know. I'm happy to work on a general SG interface that would enable you to code the ipyvolume-specific bits so that people can do gallery_conf['image_scrapers'] = ['ipyvolume'] (or so) to use ipyvolume with sphinx-gallery!

agramfort mentioned this issue Oct 12, 2018

POC: ipyvolume support #420

Open

maartenbreddels mentioned this issue Oct 12, 2018

WIP: Using a notebook + nbconvert to execute code blocks #422

Open

larsoner mentioned this issue May 1, 2019

Should image_scrapers be renamed (to output_scrapers for example)? #485

Closed

lesteve mentioned this issue Jul 9, 2019

ENH Adds HTML visualizations for estimators scikit-learn/scikit-learn#14180

Merged

lucyleeow mentioned this issue Sep 19, 2019

ENH capture 'repr's of last expression #540

Closed

larsoner mentioned this issue Jul 26, 2021

Docs via Sphinx vispy/jupyter_rfb#23

Merged

3 tasks

leouieda mentioned this issue Jan 7, 2022

Add a sphinx docs structure for hosting gmt examples GenericMappingTools/gmt-examples#1

Merged

smarie mentioned this issue Apr 12, 2022

Outsource the "pure python notebook" lib #933

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Executing and capturing rich output using notebook / nbconvert #421

Executing and capturing rich output using notebook / nbconvert #421

maartenbreddels commented Oct 12, 2018

GaelVaroquaux commented Oct 12, 2018 via email

Uh oh!

maartenbreddels commented Oct 12, 2018

Uh oh!

GaelVaroquaux commented Oct 12, 2018 via email

Uh oh!

choldgraf commented Oct 12, 2018

Uh oh!

larsoner commented Dec 14, 2018

Uh oh!

maartenbreddels commented Dec 17, 2018

Uh oh!

larsoner commented Dec 17, 2018

Uh oh!

larsoner commented Dec 17, 2018

Uh oh!

maartenbreddels commented Dec 17, 2018

Uh oh!

larsoner commented Dec 17, 2018

Uh oh!

maartenbreddels commented Dec 17, 2018

Uh oh!

choldgraf commented Dec 27, 2018

Uh oh!

larsoner commented May 19, 2019

Uh oh!

Executing and capturing rich output using notebook / nbconvert #421

Executing and capturing rich output using notebook / nbconvert #421

Comments

maartenbreddels commented Oct 12, 2018

Using the notebook/nbconvert to execute code blocks

Screenshots/thumbnails using headless chrome

Summary

GaelVaroquaux commented Oct 12, 2018 via email

Uh oh!

maartenbreddels commented Oct 12, 2018

Uh oh!

GaelVaroquaux commented Oct 12, 2018 via email

Uh oh!

choldgraf commented Oct 12, 2018

Uh oh!

larsoner commented Dec 14, 2018

Uh oh!

maartenbreddels commented Dec 17, 2018

Uh oh!

larsoner commented Dec 17, 2018

Uh oh!

larsoner commented Dec 17, 2018

Uh oh!

maartenbreddels commented Dec 17, 2018

Uh oh!

larsoner commented Dec 17, 2018

Uh oh!

maartenbreddels commented Dec 17, 2018

Uh oh!

choldgraf commented Dec 27, 2018

Uh oh!

larsoner commented May 19, 2019

Uh oh!