Jax: slogdet timing differences

Created on 23 Dec 2019 · 6Comments · Source: google/jax

Hello!

I was wondering why there is a timing difference (gpu) between jit(slogdet) and the plain slogdet expression by one order of magnitude.

From the source code, it seems that slogdet is decorated with jit already link to code.

Therefore, I would have expected no difference. Though, this is what I get.

import jax.numpy as np
from jax import random, jit

key = random.PRNGKey(0)
a = random.normal(key, (2, 2))

jit_slogdet = jit(np.linalg.slogdet)
slogdet = np.linalg.slogdet

# initial run
jit_slogdet(a)
slogdet(a)

# timeit
%timeit -n 10 jit_slogdet(a)[1].block_until_ready()
%timeit -n 10 slogdet(a)[1].block_until_ready()

Output

148 µs ± 16.4 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)
68.6 ms ± 437 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)

What am I missing?

jax-version: 0.1.55

Cheers
Christian

bug

Source

reziproke87

Most helpful comment

My guess is that this eval_jaxpr in the custom_transforms logic is getting cache misses every time, for the reason outlined in #1829. However, I had to roll back #1829 because of one internal test failure (not a JAX test) that I didn't understand.

My long-promised still-vaporware rewrite of custom_transforms would attempt to avoid eval_jaxpr, so two fix options for this issue are (1) try to roll-forward #1829, (2) just wait for a custom_transforms rewrite that fixes all problems and brings about world peace.

mattjj on 24 Dec 2019

😄2

All 6 comments

It looks like using custom_tranforms in addition to jit is slowing down raw slogdet. @mattjj is this an issue you're already aware of?

skye on 23 Dec 2019

I wasn’t aware of this but it sounds plausible. We need to rewrite custom_transforms and this will go on the list of fixes. I might not have time until after the holidays though.

Thanks for raising this!

mattjj on 23 Dec 2019

Yeah, it looks like custom_transforms needs to be inside a jit to avoid recompilation. With slogdet, the custom_transforms decorator is outside jit, so the inner jit is re-traced into a new xla_call in a new jaxpr each time slogdet is called, and that new xla_call becomes a new XLA compilation.

jekbradbury on 23 Dec 2019

For now probably best to use another jit like you have in your code. You can also use

from jax import config
config.FLAGS.jax_log_compiles=True

to see when JAX is unexpectedly compiling things.

jekbradbury on 23 Dec 2019

👍1

mattjj on 24 Dec 2019

😄2

I think this is fixed now the custom_transforms rewrite has landed!

On a V100 GPU, I get:

10 loops, best of 3: 182 µs per loop
10 loops, best of 3: 289 µs per loop

hawkinsp on 16 Apr 2020

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Gradient of `np.exp` sometimes causes invalid values

DylanMuir · 3Comments

Jaxify numpy function

sursu · 3Comments

jax tensor and numpy array convertion

yfji · 3Comments

`is` keyword is not preserved under vmap.

sschoenholz · 3Comments

Scalars passed into np.array should return 0-dim arrays

alexbw · 3Comments