Spyder: Support Torch.Tensor in the variable explorer

Created on 30 Apr 2018 · 12Comments · Source: spyder-ide/spyder

Problem Description

I'm a pytorch user, and I'm always thinking that type torch.Tensor might be represented in a proper way. What torch.Tensor type data need is just '.numpy()' to be converted to numpy array objects.

So, is it possible to apply simple scripts for the unsupported data types inside of Variable explorer?(i.e. If Variable explorer saw Tensor type objects, then a corresponding function/method for the type is applied and show the result of it on Variable explorer with distinguishable color.)

def type_conversion(data: torch.Tensor):
  if isinstance(data, torch.Tensor):
    return data.numpy()
  return None

Package Versions

Spyder: 3.2.8
Python: 3.6.4
Qt:
PyQt:
Operating System: macOS 10.13.4

Dependencies

IPython >=4.0    :  6.3.1 (OK)
cython >=0.21    :  0.28.2 (OK)
jedi >=0.9.0     :  0.12.0 (OK)
nbconvert >=4.0  :  5.3.1 (OK)
numpy >=1.7      :  1.14.1 (OK)
pandas >=0.13.1  :  0.22.0 (OK)
psutil >=0.3     :  5.4.5 (OK)
pycodestyle >=2.3:  2.4.0 (OK)
pyflakes >=0.6.0 :  1.6.0 (OK)
pygments >=2.0   :  2.2.0 (OK)
pylint >=0.25    :  1.8.4 (OK)
qtconsole >=4.2.0:  4.3.1 (OK)
rope >=0.9.4     :  0.10.7 (OK)
sphinx >=0.6.6   :  1.7.4 (OK)
sympy >=0.7.3    :  1.1.1 (OK)

Variable Explorer Enhancement future

Source

hgijeon

All 12 comments

Thanks for reporting. We'll take a look at this for Spyder 4.

ccordoba12 on 1 May 2018

To unpack this a tad, it seems the proximate issue is support for PyTorch tensors in the Variable Explorer, which could be solved with a few straightforward changes to CollectionsEditor editor selection or within ArrayEditor. However, it seems the broader feature request the user was making, as was reflected in the original title and not in the modified one, was the ability for users to add support for custom types to the Variable Explorer via simple conversion scripts to known times for which we already offer Editors for. Not sure the latter is realistic or feasible, but wanted to make sure we were all clear on what specifically the user was asking for.

CAM-Gerlach on 6 May 2018

Thanks for your reply!
What I need is the support for torch.Tensor type.
However, I wrote an idea about the general solution because I was not sure whether it's okay to put just one exceptional case for the specific torch.Tensor type. Hmm.... Could someone tell me which one is more 'Spyder' way?

hgijeon on 7 May 2018

@ccordoba12 would be the one to ultimately decide that; based on his re-titling of your issue, it seems he's focused more on the former. However, the latter could always be potentially added at some point in the (likely further) future, in the form of some type of modular "plugin" support for the variable explorer's Editor classes. It would be really need to have, but would take some work, requiring refactoring the existing Editors into a modular system and creating an API to interact with third-party editors, etc.

As somewhat of a stopgap between the quick but specific and general but more involved approaches could be Pythonic duck typing, converting unknown types to known ones (principally array, and maybe list) and regressing to the current behavior if an exception, but not sure if it would be worth the tradeoffs vs. checking against a list of types that can be converted to array, list etc. by various means.

The more custom types we supported though, the more we'd want to have the option to view any object as a generic low level __dict__ to see and interact with all its properties, along with the higher-level view we currently use when available—I've been thinking about something like this for a while, since somethings while working the former can be convenient, especially when working with derived types and convertible ones like this. @ccordoba12 , thoughts?

CAM-Gerlach on 7 May 2018

The narrow issue is similar to, perhaps the same as, #5375.

I think there should be an issue about making it easier to support new data types, with how to achieve this to be discussed.

jitseniesen on 7 May 2018

The narrow issue is similar to, perhaps the same as, #5375.

They're related, and both fit under the broader issue of PyTorch tensor support. but not the same (narrowly speaking, at least). Fixing this one (displaying tensors as arrays with ArrayEditor rather than as generic objects) would work around the other one, but not the converse; the latter would be a very simple fix along the lines of #6284 and a few similar issues I've fixed, while the former would be a tad more involved (though not that much more, at least in theory).

I think there should be an issue about making it easier to support new data types, with how to achieve this to be discussed.

Indeed, agreed on my part. @ccordoba12 ?

CAM-Gerlach on 8 May 2018

I don't think it's necessary to wait for ccordoba12 to give permission to open a new issue.

jitseniesen on 9 May 2018

👍1

Sorry :(

Thanks for opening it.

CAM-Gerlach on 9 May 2018

Now i am using spyder 4.0.0b1, but the problem is still existing :(

donglee-afar on 22 Nov 2018

@donglee-afar We've been busy implementing the overhauled completion/introspection/analysis architecture and the new debugger for the next betas, but more modular support for new Variable Explorer datatypes is something we'd like to add in the future.

In the meantime, though, it should actually not be too difficult to add support for viewing and even edit Tensorflow tensors as Numpy arrays, so long as the output arrays are <4D (same as with Numpy arrays). We already do something similar for Image here; you would just need to add a block below that one following the same outline, except replacing arr = array(value) with arr = value.numpy() and you wouldn't even need a conv_func as a Pytorch tensor and its Numpy array counterpart share the same data locations in memory so they update automatically without having to convert them back.

Other than that, there's only one complexity—you'll need to add a short try-except block here in Spyder-Kernels trying to import tensor from torch, and making it a FakeObject if not. Finally, in CollectionEditor, you'll need to add tensor to the imports from spyder_kernels.utils.nsview.

After you test this and ensure its working with dev versions of Spyder-Kernels and Spyder, there's only one other problem—it will make Spyder crash if it doesn't have the latest dev version of Spyder-Kernels (once your changes are merged). To fix that, at the end when you want to add some backwards compatibility, you can add a try-except around the torch import from spyder-kernels and set torch = FakeObject there (or, you could do the same import check there, and actually enable it if so, but that gets a little messy). Do that change in a separate commit so it can easily be reverted once Spyder-Kernels is out and well-available. Thanks!

CAM-Gerlach on 22 Nov 2018

👍1

Thank's for the detailed guide!
I had a quick look and it basically works :) But it also requires an additional check at arrayeditor.py in line 620 to avoid setting the writeable flag to true. It is true already and setting it (again?) caused the error cannot set WRITEABLE flag to True of this array This happens, when calling the editor setup similar to "editor.setup_and_check( value.numpy(), title=key, readonly=readonly)".

Anyway, I'm not very familiar with pytorch, just tensorflow, so I don't know about side effects if pytorch is basically always imported at kernel startup! For tensorflow that could cause some headace since it would e.g. prohibit to blank out graphics cards before the import etc. Furthermore, for tensorflow the initial import can sometimes take quite long, and as said I don't have the experience for pytorch.
So I guess this feature should probably require activation through a button somewhere in a menu...?

However, I do have another suggestion that could improve daily work for ML developers in tensorflow and pytorch, without the need of importing these libraries at startup. Most of the time, I as ML developer would already be happy to see basic info about my tensors like datatypes and shapes inside the variable explorer. Of course an array editor would be even better, but lets say this is step 2.

So I did a quick experiment by expanding the "get_size(item)" function inside nsview.py with an extra elif, that checks if the item has a shape attribute. If it's string representation matches a kown pattern it would be returned.

    if hasattr(item, "shape"):
        try:
            shape_str_repr = repr(item.shape)
            if shape_str_repr[0:13] == 'TensorShape([':
                return str(item.shape)
            elif shape_str_repr[0:12] == "torch.Size([":
                return "({:s})".format(shape_str_repr[12:-2])
        except:
            return 1

I know that this is hacky, but it could be a good help for daily work. And in the worst case it should just fall back to the current status quo.
Is something like this of interest?
(If yes let me know more details, shall I convert the output back to a tuple of numbers, or is a string ok? Just for info Tensorflow has "None" shapes as well, so (None,None,1) is a valid shape there)

hofingermarkus on 21 Jan 2019

👍1

Hello, I am a newcomer to PyTorch, I tried to see the tensor variable on the spyder ide, I tried to do what @CAM-Gerlach directed but still don't know how to solve it. Can I get more detailed steps or teaching?
Thanks!