Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@AlexGuteniev
Copy link
Contributor

AVX2 masks only

Before

---------------------------------------------------------------
Benchmark                     Time             CPU   Iterations
---------------------------------------------------------------
r<std::uint32_t>           2005 ns         1086 ns      1280000
r<std::uint64_t>           2107 ns         1247 ns       814545

After

---------------------------------------------------------------
Benchmark                     Time             CPU   Iterations
---------------------------------------------------------------
r<std::uint32_t>            173 ns         73.8 ns     19063830
r<std::uint64_t>            441 ns          178 ns      7466667

@AlexGuteniev AlexGuteniev requested a review from a team as a code owner April 2, 2024 11:48
@StephanTLavavej StephanTLavavej added the performance Must go faster label Apr 2, 2024
@StephanTLavavej

This comment was marked as resolved.

@AlexGuteniev

This comment was marked as resolved.

@StephanTLavavej StephanTLavavej self-assigned this Apr 2, 2024
@AlexGuteniev AlexGuteniev changed the title vectorize replace vectorize replace 🎭 Apr 2, 2024
@StephanTLavavej
Copy link
Member

I'm mirroring this to the MSVC-internal repo - please notify me if any further changes are pushed.

@StephanTLavavej StephanTLavavej merged commit bed2673 into microsoft:main Apr 9, 2024
@StephanTLavavej
Copy link
Member

Thanks for massively improving performance here! 🚀 😸 🐈

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

performance Must go faster

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

2 participants