(feat): use `np.zeros` for buffer creation with `fill_value=0` #3082

ilan-gold · 2025-05-22T10:01:01Z

I was profiling code and noticed a non-trivial amount of time spent on np.full with a fill_value=0 so I did a little digging and at least on my machine:

In [1]: import numpy as np

In [2]: %timeit np.full((10_000, 10_000), dtype=np.float32, fill_value=0, order="C")
33.2 ms ± 784 μs per loop (mean ± std. dev. of 7 runs, 10 loops each)

In [3]: %timeit np.zeros((10_000, 10_000), dtype=np.float32, order="C")
1.04 μs ± 1.48 ns per loop (mean ± std. dev. of 7 runs, 1,000,000 loops each)

For example:

zarr-python/src/zarr/core/array.py

Lines 1285 to 1290 in 481550a

    
           out_buffer = prototype.nd_buffer.create( 
        
               shape=indexer.shape, 
        
               dtype=out_dtype, 
        
               order=self.order, 
        
               fill_value=self.metadata.fill_value, 
        
           )

and

zarr-python/src/zarr/codecs/sharding.py

Lines 441 to 443 in 481550a

    
           out = chunk_spec.prototype.nd_buffer.create( 
        
               shape=shard_shape, dtype=shard_spec.dtype, order=shard_spec.order, fill_value=0 
        
           )

are both points where this happens

TODO:

Add unit tests and/or doctests in docstrings
Add docstrings and API docs for any new/modified user-facing classes and functions
New/modified features documented in docs/user-guide/*.rst
Changes documented as a new file in changes/
GitHub Actions have all passed
Test coverage is 100% (Codecov passes)

d-v-b · 2025-05-22T10:10:52Z

out of curiosity, is there any performance difference between np.zeros(...)[:] = fill_value and np.full(..., fill_value)? I would expect not but I have no idea.

src/zarr/core/buffer/cpu.py

ilan-gold · 2025-05-22T10:22:18Z

Not much :/

In [1]: import numpy as np
In [2]: %timeit d = np.zeros((10_000, 10_000), dtype=np.float32, order="C"); d[:] = 1
35.1 ms ± 934 μs per loop (mean ± std. dev. of 7 runs, 10 loops each)
In [3]: %timeit np.full((10_000, 10_000), dtype=np.float32, fill_value=1, order="C")
37.7 ms ± 2.15 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

ilan-gold · 2025-05-22T10:48:40Z

@d-v-b I think the failing test is unrelated to this PR. I'm also seeing it locally on main

d-v-b · 2025-06-30T09:39:26Z

@meeseeksdev backport to 3.0.9

lumberbot-app · 2025-06-30T09:39:31Z

Can't Dooooo.... It seem like this is already backported (commit is empty).I won't do anything. MrMeeseeks out.

(feat): use np.zeros for buffer creation when possible

9eac892

github-actions bot added the needs release notes Automatically applied to PRs which haven't added release notes label May 22, 2025

ilan-gold changed the title ~~(feat): use np.zeros for buffer creation fill_value=0~~ (feat): use np.zeros for buffer creation with fill_value=0 May 22, 2025

d-v-b reviewed May 22, 2025

View reviewed changes

src/zarr/core/buffer/cpu.py Outdated Show resolved Hide resolved

d-v-b added the performance Potential issues with Zarr performance (I/O, memory, etc.) label May 22, 2025

(fix): add comment and check

29508cf

ilan-gold force-pushed the ig/fill_value_0 branch from a11f515 to 29508cf Compare May 22, 2025 10:25

(chore): relnote

b0f815f

github-actions bot removed the needs release notes Automatically applied to PRs which haven't added release notes label May 22, 2025

dstansby approved these changes May 23, 2025

View reviewed changes

dstansby merged commit f674236 into zarr-developers:main May 23, 2025
30 checks passed

ilan-gold deleted the ig/fill_value_0 branch May 23, 2025 11:07

d-v-b added this to the 3.0.9 milestone Jun 30, 2025

d-v-b removed this from the 3.0.9 milestone Jun 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

(feat): use `np.zeros` for buffer creation with `fill_value=0` #3082

(feat): use `np.zeros` for buffer creation with `fill_value=0` #3082

Uh oh!

ilan-gold commented May 22, 2025 •

edited

Loading

Uh oh!

d-v-b commented May 22, 2025

Uh oh!

Uh oh!

ilan-gold commented May 22, 2025

Uh oh!

ilan-gold commented May 22, 2025

Uh oh!

Uh oh!

d-v-b commented Jun 30, 2025

Uh oh!

lumberbot-app bot commented Jun 30, 2025

Uh oh!

Uh oh!

	out_buffer = prototype.nd_buffer.create(
	shape=indexer.shape,
	dtype=out_dtype,
	order=self.order,
	fill_value=self.metadata.fill_value,
	)

	out = chunk_spec.prototype.nd_buffer.create(
	shape=shard_shape, dtype=shard_spec.dtype, order=shard_spec.order, fill_value=0
	)

Uh oh!

(feat): use np.zeros for buffer creation with fill_value=0 #3082

(feat): use np.zeros for buffer creation with fill_value=0 #3082

Uh oh!

Conversation

ilan-gold commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

d-v-b commented May 22, 2025

Uh oh!

Uh oh!

ilan-gold commented May 22, 2025

Uh oh!

ilan-gold commented May 22, 2025

Uh oh!

Uh oh!

d-v-b commented Jun 30, 2025

Uh oh!

lumberbot-app bot commented Jun 30, 2025

Uh oh!

Uh oh!

(feat): use `np.zeros` for buffer creation with `fill_value=0` #3082

(feat): use `np.zeros` for buffer creation with `fill_value=0` #3082

ilan-gold commented May 22, 2025 •

edited

Loading