Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

vmoens
Copy link
Collaborator

@vmoens vmoens commented Oct 22, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
@pytorch-bot
Copy link

pytorch-bot bot commented Oct 22, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2511

Note: Links to docs will display an error until the docs builds have been completed.

❌ 9 New Failures, 4 Unrelated Failures

As of commit 6c71b70 with merge base baba52b (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens pushed a commit that referenced this pull request Oct 22, 2024
ghstack-source-id: 2ab8ae3
Pull Request resolved: #2511
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 22, 2024
@vmoens vmoens added Tests Incomplete or broken unit tests CI Has to do with CI setup (e.g. wheels & builds, tests...) labels Oct 22, 2024
@vmoens vmoens merged commit 6c71b70 into gh/vmoens/35/base Oct 22, 2024
50 of 59 checks passed
vmoens pushed a commit that referenced this pull request Oct 22, 2024
ghstack-source-id: 2ab8ae3
Pull Request resolved: #2511
@vmoens vmoens deleted the gh/vmoens/35/head branch October 22, 2024 06:53
@vmoens vmoens changed the title [CI] Fix winndows compile tests [CI] Fix windows compile tests Oct 22, 2024
@github-actions
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 143. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}9$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4246s 0.4222s 2.3685 Ops/s 2.2784 Ops/s $\color{#35bf28}+3.95\%$
test_transformed 0.7209s 0.6251s 1.5996 Ops/s 1.6664 Ops/s $\color{#d91a1a}-4.01\%$
test_serial 1.4551s 1.3700s 0.7299 Ops/s 0.7356 Ops/s $\color{#d91a1a}-0.77\%$
test_parallel 1.4432s 1.3396s 0.7465 Ops/s 0.7456 Ops/s $\color{#35bf28}+0.11\%$
test_step_mdp_speed[True-True-True-True-True] 95.2050μs 28.3006μs 35.3349 KOps/s 34.4044 KOps/s $\color{#35bf28}+2.70\%$
test_step_mdp_speed[True-True-True-True-False] 46.4560μs 17.0533μs 58.6396 KOps/s 56.8620 KOps/s $\color{#35bf28}+3.13\%$
test_step_mdp_speed[True-True-True-False-True] 77.3840μs 15.8769μs 62.9844 KOps/s 60.5817 KOps/s $\color{#35bf28}+3.97\%$
test_step_mdp_speed[True-True-True-False-False] 30.4270μs 9.3882μs 106.5164 KOps/s 103.0498 KOps/s $\color{#35bf28}+3.36\%$
test_step_mdp_speed[True-True-False-True-True] 76.7930μs 30.8581μs 32.4064 KOps/s 31.6212 KOps/s $\color{#35bf28}+2.48\%$
test_step_mdp_speed[True-True-False-True-False] 63.4480μs 19.2926μs 51.8334 KOps/s 51.0917 KOps/s $\color{#35bf28}+1.45\%$
test_step_mdp_speed[True-True-False-False-True] 79.0070μs 18.0420μs 55.4263 KOps/s 53.8478 KOps/s $\color{#35bf28}+2.93\%$
test_step_mdp_speed[True-True-False-False-False] 62.8570μs 11.5656μs 86.4636 KOps/s 83.5473 KOps/s $\color{#35bf28}+3.49\%$
test_step_mdp_speed[True-False-True-True-True] 92.4830μs 32.8747μs 30.4185 KOps/s 29.9680 KOps/s $\color{#35bf28}+1.50\%$
test_step_mdp_speed[True-False-True-True-False] 82.4530μs 21.5310μs 46.4446 KOps/s 46.3764 KOps/s $\color{#35bf28}+0.15\%$
test_step_mdp_speed[True-False-True-False-True] 55.6330μs 18.2001μs 54.9448 KOps/s 53.9032 KOps/s $\color{#35bf28}+1.93\%$
test_step_mdp_speed[True-False-True-False-False] 59.0000μs 11.5814μs 86.3456 KOps/s 84.1387 KOps/s $\color{#35bf28}+2.62\%$
test_step_mdp_speed[True-False-False-True-True] 75.0190μs 35.1159μs 28.4771 KOps/s 27.9065 KOps/s $\color{#35bf28}+2.04\%$
test_step_mdp_speed[True-False-False-True-False] 73.6570μs 23.3752μs 42.7803 KOps/s 42.2845 KOps/s $\color{#35bf28}+1.17\%$
test_step_mdp_speed[True-False-False-False-True] 50.2840μs 20.1122μs 49.7210 KOps/s 48.6376 KOps/s $\color{#35bf28}+2.23\%$
test_step_mdp_speed[True-False-False-False-False] 67.8270μs 13.4275μs 74.4738 KOps/s 71.0634 KOps/s $\color{#35bf28}+4.80\%$
test_step_mdp_speed[False-True-True-True-True] 94.0850μs 33.0516μs 30.2558 KOps/s 29.7160 KOps/s $\color{#35bf28}+1.82\%$
test_step_mdp_speed[False-True-True-True-False] 51.6370μs 21.2420μs 47.0766 KOps/s 45.4075 KOps/s $\color{#35bf28}+3.68\%$
test_step_mdp_speed[False-True-True-False-True] 78.7170μs 21.0079μs 47.6012 KOps/s 45.4025 KOps/s $\color{#35bf28}+4.84\%$
test_step_mdp_speed[False-True-True-False-False] 41.8580μs 13.1509μs 76.0402 KOps/s 72.3874 KOps/s $\textbf{\color{#35bf28}+5.05\%}$
test_step_mdp_speed[False-True-False-True-True] 91.6410μs 34.7922μs 28.7421 KOps/s 29.2415 KOps/s $\color{#d91a1a}-1.71\%$
test_step_mdp_speed[False-True-False-True-False] 58.9000μs 23.5587μs 42.4472 KOps/s 42.6515 KOps/s $\color{#d91a1a}-0.48\%$
test_step_mdp_speed[False-True-False-False-True] 2.7171ms 22.8500μs 43.7636 KOps/s 42.4341 KOps/s $\color{#35bf28}+3.13\%$
test_step_mdp_speed[False-True-False-False-False] 61.2440μs 15.2148μs 65.7256 KOps/s 64.1269 KOps/s $\color{#35bf28}+2.49\%$
test_step_mdp_speed[False-False-True-True-True] 89.4660μs 36.9887μs 27.0352 KOps/s 27.0415 KOps/s $\color{#d91a1a}-0.02\%$
test_step_mdp_speed[False-False-True-True-False] 84.4070μs 25.3446μs 39.4562 KOps/s 38.8092 KOps/s $\color{#35bf28}+1.67\%$
test_step_mdp_speed[False-False-True-False-True] 50.8250μs 23.1831μs 43.1349 KOps/s 42.6734 KOps/s $\color{#35bf28}+1.08\%$
test_step_mdp_speed[False-False-True-False-False] 67.2650μs 15.2271μs 65.6725 KOps/s 64.0670 KOps/s $\color{#35bf28}+2.51\%$
test_step_mdp_speed[False-False-False-True-True] 0.1000ms 38.7230μs 25.8244 KOps/s 25.2402 KOps/s $\color{#35bf28}+2.31\%$
test_step_mdp_speed[False-False-False-True-False] 82.6240μs 27.2564μs 36.6887 KOps/s 36.2994 KOps/s $\color{#35bf28}+1.07\%$
test_step_mdp_speed[False-False-False-False-True] 84.4100μs 24.4162μs 40.9564 KOps/s 39.3922 KOps/s $\color{#35bf28}+3.97\%$
test_step_mdp_speed[False-False-False-False-False] 56.3350μs 16.7817μs 59.5889 KOps/s 56.8032 KOps/s $\color{#35bf28}+4.90\%$
test_values[generalized_advantage_estimate-True-True] 10.9896ms 9.7392ms 102.6783 Ops/s 102.7688 Ops/s $\color{#d91a1a}-0.09\%$
test_values[vec_generalized_advantage_estimate-True-True] 53.4865ms 41.8559ms 23.8915 Ops/s 29.8092 Ops/s $\textbf{\color{#d91a1a}-19.85\%}$
test_values[td0_return_estimate-False-False] 0.2460ms 0.1897ms 5.2717 KOps/s 5.5734 KOps/s $\textbf{\color{#d91a1a}-5.41\%}$
test_values[td1_return_estimate-False-False] 27.5666ms 24.4549ms 40.8915 Ops/s 41.3230 Ops/s $\color{#d91a1a}-1.04\%$
test_values[vec_td1_return_estimate-False-False] 41.4247ms 36.4217ms 27.4561 Ops/s 29.6940 Ops/s $\textbf{\color{#d91a1a}-7.54\%}$
test_values[td_lambda_return_estimate-True-False] 36.3798ms 34.9750ms 28.5918 Ops/s 28.5887 Ops/s $\color{#35bf28}+0.01\%$
test_values[vec_td_lambda_return_estimate-True-False] 38.4052ms 36.5475ms 27.3617 Ops/s 29.7484 Ops/s $\textbf{\color{#d91a1a}-8.02\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 11.3986ms 8.2815ms 120.7517 Ops/s 119.3308 Ops/s $\color{#35bf28}+1.19\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.5555ms 1.9843ms 503.9579 Ops/s 519.1866 Ops/s $\color{#d91a1a}-2.93\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5667ms 0.3629ms 2.7556 KOps/s 2.8255 KOps/s $\color{#d91a1a}-2.47\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 50.5121ms 48.0377ms 20.8170 Ops/s 23.1236 Ops/s $\textbf{\color{#d91a1a}-9.98\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.9996ms 3.0740ms 325.3061 Ops/s 326.0520 Ops/s $\color{#d91a1a}-0.23\%$
test_dqn_speed[False-None] 6.0551ms 1.3859ms 721.5711 Ops/s 725.1467 Ops/s $\color{#d91a1a}-0.49\%$
test_dqn_speed[False-backward] 1.9506ms 1.8425ms 542.7309 Ops/s 535.4416 Ops/s $\color{#35bf28}+1.36\%$
test_dqn_speed[True-None] 0.7368ms 0.4676ms 2.1385 KOps/s 2.1349 KOps/s $\color{#35bf28}+0.17\%$
test_dqn_speed[True-backward] 0.9910ms 0.9040ms 1.1062 KOps/s 1.1010 KOps/s $\color{#35bf28}+0.48\%$
test_dqn_speed[reduce-overhead-None] 0.7182ms 0.4739ms 2.1101 KOps/s 2.1138 KOps/s $\color{#d91a1a}-0.18\%$
test_dqn_speed[reduce-overhead-backward] 0.9512ms 0.8894ms 1.1243 KOps/s 1.1297 KOps/s $\color{#d91a1a}-0.47\%$
test_ddpg_speed[False-None] 3.5097ms 2.8302ms 353.3317 Ops/s 349.8132 Ops/s $\color{#35bf28}+1.01\%$
test_ddpg_speed[False-backward] 4.1909ms 3.9291ms 254.5143 Ops/s 251.3689 Ops/s $\color{#35bf28}+1.25\%$
test_ddpg_speed[True-None] 1.7382ms 1.0165ms 983.7685 Ops/s 985.7464 Ops/s $\color{#d91a1a}-0.20\%$
test_ddpg_speed[True-backward] 2.2048ms 1.9465ms 513.7457 Ops/s 523.6933 Ops/s $\color{#d91a1a}-1.90\%$
test_ddpg_speed[reduce-overhead-None] 1.3396ms 1.0187ms 981.6806 Ops/s 993.6318 Ops/s $\color{#d91a1a}-1.20\%$
test_ddpg_speed[reduce-overhead-backward] 1.9985ms 1.9100ms 523.5594 Ops/s 523.2638 Ops/s $\color{#35bf28}+0.06\%$
test_sac_speed[False-None] 9.1001ms 8.1035ms 123.4041 Ops/s 122.3497 Ops/s $\color{#35bf28}+0.86\%$
test_sac_speed[False-backward] 11.3428ms 10.8449ms 92.2095 Ops/s 91.9784 Ops/s $\color{#35bf28}+0.25\%$
test_sac_speed[True-None] 2.1520ms 1.8622ms 537.0135 Ops/s 532.8788 Ops/s $\color{#35bf28}+0.78\%$
test_sac_speed[True-backward] 3.7204ms 3.5755ms 279.6781 Ops/s 247.2042 Ops/s $\textbf{\color{#35bf28}+13.14\%}$
test_sac_speed[reduce-overhead-None] 2.1515ms 1.8775ms 532.6355 Ops/s 525.0863 Ops/s $\color{#35bf28}+1.44\%$
test_sac_speed[reduce-overhead-backward] 3.6964ms 3.5780ms 279.4862 Ops/s 271.3250 Ops/s $\color{#35bf28}+3.01\%$
test_redq_speed[False-None] 20.4029ms 13.4693ms 74.2429 Ops/s 74.0183 Ops/s $\color{#35bf28}+0.30\%$
test_redq_speed[False-backward] 32.4186ms 23.0016ms 43.4752 Ops/s 43.7563 Ops/s $\color{#d91a1a}-0.64\%$
test_redq_speed[True-None] 6.7176ms 5.0104ms 199.5850 Ops/s 179.0234 Ops/s $\textbf{\color{#35bf28}+11.49\%}$
test_redq_speed[True-backward] 14.8277ms 12.7146ms 78.6497 Ops/s 78.2232 Ops/s $\color{#35bf28}+0.55\%$
test_redq_speed[reduce-overhead-None] 6.3103ms 5.0359ms 198.5726 Ops/s 193.2499 Ops/s $\color{#35bf28}+2.75\%$
test_redq_speed[reduce-overhead-backward] 13.1226ms 12.6592ms 78.9940 Ops/s 79.2001 Ops/s $\color{#d91a1a}-0.26\%$
test_redq_deprec_speed[False-None] 14.9391ms 13.3485ms 74.9150 Ops/s 72.2538 Ops/s $\color{#35bf28}+3.68\%$
test_redq_deprec_speed[False-backward] 19.5807ms 18.7613ms 53.3012 Ops/s 50.1774 Ops/s $\textbf{\color{#35bf28}+6.23\%}$
test_redq_deprec_speed[True-None] 4.3543ms 3.7410ms 267.3097 Ops/s 258.0248 Ops/s $\color{#35bf28}+3.60\%$
test_redq_deprec_speed[True-backward] 8.6867ms 8.2886ms 120.6471 Ops/s 112.5734 Ops/s $\textbf{\color{#35bf28}+7.17\%}$
test_redq_deprec_speed[reduce-overhead-None] 4.5059ms 3.7052ms 269.8940 Ops/s 255.8072 Ops/s $\textbf{\color{#35bf28}+5.51\%}$
test_redq_deprec_speed[reduce-overhead-backward] 8.8054ms 8.3158ms 120.2532 Ops/s 115.0049 Ops/s $\color{#35bf28}+4.56\%$
test_td3_speed[False-None] 8.7915ms 8.1127ms 123.2630 Ops/s 123.9217 Ops/s $\color{#d91a1a}-0.53\%$
test_td3_speed[False-backward] 18.0497ms 10.8290ms 92.3442 Ops/s 94.7501 Ops/s $\color{#d91a1a}-2.54\%$
test_td3_speed[True-None] 2.0655ms 1.7853ms 560.1208 Ops/s 559.9670 Ops/s $\color{#35bf28}+0.03\%$
test_td3_speed[True-backward] 4.1686ms 3.5966ms 278.0442 Ops/s 269.8287 Ops/s $\color{#35bf28}+3.04\%$
test_td3_speed[reduce-overhead-None] 1.8830ms 1.7821ms 561.1390 Ops/s 560.8252 Ops/s $\color{#35bf28}+0.06\%$
test_td3_speed[reduce-overhead-backward] 3.5091ms 3.4006ms 294.0625 Ops/s 288.9318 Ops/s $\color{#35bf28}+1.78\%$
test_cql_speed[False-None] 38.7540ms 36.2566ms 27.5812 Ops/s 27.0159 Ops/s $\color{#35bf28}+2.09\%$
test_cql_speed[False-backward] 0.3322s 53.9254ms 18.5441 Ops/s 21.0372 Ops/s $\textbf{\color{#d91a1a}-11.85\%}$
test_cql_speed[True-None] 17.9486ms 15.9942ms 62.5225 Ops/s 62.1482 Ops/s $\color{#35bf28}+0.60\%$
test_cql_speed[True-backward] 29.4957ms 23.1563ms 43.1849 Ops/s 43.6782 Ops/s $\color{#d91a1a}-1.13\%$
test_cql_speed[reduce-overhead-None] 17.3710ms 15.9921ms 62.5307 Ops/s 62.1911 Ops/s $\color{#35bf28}+0.55\%$
test_cql_speed[reduce-overhead-backward] 24.8246ms 22.7755ms 43.9069 Ops/s 43.4766 Ops/s $\color{#35bf28}+0.99\%$
test_a2c_speed[False-None] 8.7175ms 7.3100ms 136.7994 Ops/s 135.1420 Ops/s $\color{#35bf28}+1.23\%$
test_a2c_speed[False-backward] 18.0238ms 15.1209ms 66.1335 Ops/s 67.7573 Ops/s $\color{#d91a1a}-2.40\%$
test_a2c_speed[True-None] 4.0197ms 3.3447ms 298.9768 Ops/s 293.9224 Ops/s $\color{#35bf28}+1.72\%$
test_a2c_speed[True-backward] 11.4523ms 10.1948ms 98.0895 Ops/s 98.3384 Ops/s $\color{#d91a1a}-0.25\%$
test_a2c_speed[reduce-overhead-None] 3.9377ms 3.3737ms 296.4077 Ops/s 298.3618 Ops/s $\color{#d91a1a}-0.65\%$
test_a2c_speed[reduce-overhead-backward] 10.4754ms 10.0241ms 99.7593 Ops/s 101.1126 Ops/s $\color{#d91a1a}-1.34\%$
test_ppo_speed[False-None] 8.8961ms 7.5848ms 131.8423 Ops/s 131.6502 Ops/s $\color{#35bf28}+0.15\%$
test_ppo_speed[False-backward] 15.6048ms 15.2172ms 65.7152 Ops/s 67.0598 Ops/s $\color{#d91a1a}-2.00\%$
test_ppo_speed[True-None] 4.1120ms 3.7419ms 267.2406 Ops/s 255.4394 Ops/s $\color{#35bf28}+4.62\%$
test_ppo_speed[True-backward] 10.5415ms 9.8432ms 101.5934 Ops/s 98.9611 Ops/s $\color{#35bf28}+2.66\%$
test_ppo_speed[reduce-overhead-None] 4.5249ms 3.7373ms 267.5710 Ops/s 264.5685 Ops/s $\color{#35bf28}+1.13\%$
test_ppo_speed[reduce-overhead-backward] 10.4080ms 9.7363ms 102.7084 Ops/s 96.7938 Ops/s $\textbf{\color{#35bf28}+6.11\%}$
test_reinforce_speed[False-None] 7.7948ms 6.5309ms 153.1182 Ops/s 150.9175 Ops/s $\color{#35bf28}+1.46\%$
test_reinforce_speed[False-backward] 11.3584ms 9.9209ms 100.7973 Ops/s 99.5172 Ops/s $\color{#35bf28}+1.29\%$
test_reinforce_speed[True-None] 3.4823ms 2.6900ms 371.7456 Ops/s 367.5662 Ops/s $\color{#35bf28}+1.14\%$
test_reinforce_speed[True-backward] 11.8006ms 9.0237ms 110.8193 Ops/s 113.1031 Ops/s $\color{#d91a1a}-2.02\%$
test_reinforce_speed[reduce-overhead-None] 3.3343ms 2.6868ms 372.1900 Ops/s 366.5562 Ops/s $\color{#35bf28}+1.54\%$
test_reinforce_speed[reduce-overhead-backward] 10.2065ms 8.8845ms 112.5551 Ops/s 111.9488 Ops/s $\color{#35bf28}+0.54\%$
test_iql_speed[False-None] 33.7518ms 32.3441ms 30.9176 Ops/s 30.0946 Ops/s $\color{#35bf28}+2.73\%$
test_iql_speed[False-backward] 47.3147ms 45.3832ms 22.0346 Ops/s 21.6866 Ops/s $\color{#35bf28}+1.60\%$
test_iql_speed[True-None] 12.2503ms 10.9417ms 91.3933 Ops/s 88.7721 Ops/s $\color{#35bf28}+2.95\%$
test_iql_speed[True-backward] 23.0589ms 22.1063ms 45.2360 Ops/s 43.7377 Ops/s $\color{#35bf28}+3.43\%$
test_iql_speed[reduce-overhead-None] 11.6379ms 10.8853ms 91.8666 Ops/s 88.8799 Ops/s $\color{#35bf28}+3.36\%$
test_iql_speed[reduce-overhead-backward] 23.3194ms 22.4661ms 44.5114 Ops/s 44.2656 Ops/s $\color{#35bf28}+0.56\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.5089ms 5.0587ms 197.6797 Ops/s 192.1538 Ops/s $\color{#35bf28}+2.88\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7291ms 0.4925ms 2.0304 KOps/s 2.0432 KOps/s $\color{#d91a1a}-0.62\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7671ms 0.4616ms 2.1664 KOps/s 2.1582 KOps/s $\color{#35bf28}+0.38\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.3172ms 4.9147ms 203.4722 Ops/s 200.3691 Ops/s $\color{#35bf28}+1.55\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.1763ms 0.4857ms 2.0587 KOps/s 2.0592 KOps/s $\color{#d91a1a}-0.03\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 1.2923ms 0.4762ms 2.0999 KOps/s 2.1666 KOps/s $\color{#d91a1a}-3.08\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 3.0417ms 1.6073ms 622.1620 Ops/s 621.4570 Ops/s $\color{#35bf28}+0.11\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.8472ms 1.5445ms 647.4662 Ops/s 645.3689 Ops/s $\color{#35bf28}+0.32\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.8039ms 4.9699ms 201.2127 Ops/s 196.2712 Ops/s $\color{#35bf28}+2.52\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.9830ms 0.6225ms 1.6063 KOps/s 1.5785 KOps/s $\color{#35bf28}+1.76\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.1285ms 0.6077ms 1.6456 KOps/s 1.6725 KOps/s $\color{#d91a1a}-1.61\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.7594ms 4.9382ms 202.5010 Ops/s 200.4922 Ops/s $\color{#35bf28}+1.00\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.2037ms 0.5156ms 1.9393 KOps/s 2.0538 KOps/s $\textbf{\color{#d91a1a}-5.57\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6000ms 0.4635ms 2.1573 KOps/s 2.0782 KOps/s $\color{#35bf28}+3.81\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.4763ms 4.8252ms 207.2461 Ops/s 206.2956 Ops/s $\color{#35bf28}+0.46\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.0495ms 0.4916ms 2.0341 KOps/s 2.0226 KOps/s $\color{#35bf28}+0.57\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7069ms 0.4667ms 2.1428 KOps/s 2.1488 KOps/s $\color{#d91a1a}-0.28\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.6685ms 4.9882ms 200.4719 Ops/s 193.2841 Ops/s $\color{#35bf28}+3.72\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.8053ms 0.6344ms 1.5763 KOps/s 1.5669 KOps/s $\color{#35bf28}+0.60\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8180ms 0.5959ms 1.6782 KOps/s 1.6136 KOps/s $\color{#35bf28}+4.00\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.5784ms 4.2585ms 234.8272 Ops/s 222.8423 Ops/s $\textbf{\color{#35bf28}+5.38\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.7760ms 2.3531ms 424.9670 Ops/s 437.4877 Ops/s $\color{#d91a1a}-2.86\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 4.2766ms 1.2839ms 778.8871 Ops/s 749.4606 Ops/s $\color{#35bf28}+3.93\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4281s 12.7130ms 78.6598 Ops/s 224.6571 Ops/s $\textbf{\color{#d91a1a}-64.99\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.3250ms 2.3594ms 423.8361 Ops/s 435.4836 Ops/s $\color{#d91a1a}-2.67\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 1.9430ms 1.2595ms 793.9550 Ops/s 727.9041 Ops/s $\textbf{\color{#35bf28}+9.07\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 6.0930ms 4.4680ms 223.8155 Ops/s 214.3720 Ops/s $\color{#35bf28}+4.41\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 6.3674ms 2.4432ms 409.3043 Ops/s 421.6996 Ops/s $\color{#d91a1a}-2.94\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.5006ms 1.5094ms 662.5174 Ops/s 700.5298 Ops/s $\textbf{\color{#d91a1a}-5.43\%}$

@github-actions
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 143. Improved: $\large\color{#35bf28}15$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7196s 0.7192s 1.3904 Ops/s 1.3891 Ops/s $\color{#35bf28}+0.09\%$
test_transformed 1.0509s 0.9766s 1.0240 Ops/s 1.0463 Ops/s $\color{#d91a1a}-2.13\%$
test_serial 2.1894s 2.1106s 0.4738 Ops/s 0.4789 Ops/s $\color{#d91a1a}-1.07\%$
test_parallel 2.1404s 2.0482s 0.4882 Ops/s 0.4971 Ops/s $\color{#d91a1a}-1.78\%$
test_step_mdp_speed[True-True-True-True-True] 0.1816ms 39.2884μs 25.4528 KOps/s 25.7202 KOps/s $\color{#d91a1a}-1.04\%$
test_step_mdp_speed[True-True-True-True-False] 47.2220μs 23.2446μs 43.0207 KOps/s 43.0495 KOps/s $\color{#d91a1a}-0.07\%$
test_step_mdp_speed[True-True-True-False-True] 63.8930μs 21.4235μs 46.6777 KOps/s 46.9296 KOps/s $\color{#d91a1a}-0.54\%$
test_step_mdp_speed[True-True-True-False-False] 39.9920μs 12.4508μs 80.3158 KOps/s 79.7943 KOps/s $\color{#35bf28}+0.65\%$
test_step_mdp_speed[True-True-False-True-True] 71.9540μs 42.2521μs 23.6674 KOps/s 23.7665 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[True-True-False-True-False] 61.1130μs 25.8020μs 38.7567 KOps/s 38.9059 KOps/s $\color{#d91a1a}-0.38\%$
test_step_mdp_speed[True-True-False-False-True] 52.5930μs 24.3004μs 41.1516 KOps/s 41.7896 KOps/s $\color{#d91a1a}-1.53\%$
test_step_mdp_speed[True-True-False-False-False] 48.8730μs 15.2058μs 65.7645 KOps/s 65.0887 KOps/s $\color{#35bf28}+1.04\%$
test_step_mdp_speed[True-False-True-True-True] 68.5730μs 44.9040μs 22.2697 KOps/s 22.7092 KOps/s $\color{#d91a1a}-1.94\%$
test_step_mdp_speed[True-False-True-True-False] 57.1620μs 28.6201μs 34.9405 KOps/s 35.0079 KOps/s $\color{#d91a1a}-0.19\%$
test_step_mdp_speed[True-False-True-False-True] 53.2220μs 24.2038μs 41.3159 KOps/s 41.6258 KOps/s $\color{#d91a1a}-0.74\%$
test_step_mdp_speed[True-False-True-False-False] 39.7020μs 15.0077μs 66.6326 KOps/s 66.5727 KOps/s $\color{#35bf28}+0.09\%$
test_step_mdp_speed[True-False-False-True-True] 76.6740μs 47.1375μs 21.2145 KOps/s 21.4758 KOps/s $\color{#d91a1a}-1.22\%$
test_step_mdp_speed[True-False-False-True-False] 96.5750μs 30.4657μs 32.8238 KOps/s 32.4194 KOps/s $\color{#35bf28}+1.25\%$
test_step_mdp_speed[True-False-False-False-True] 58.6830μs 26.6631μs 37.5051 KOps/s 37.8619 KOps/s $\color{#d91a1a}-0.94\%$
test_step_mdp_speed[True-False-False-False-False] 48.0620μs 17.8385μs 56.0584 KOps/s 55.9527 KOps/s $\color{#35bf28}+0.19\%$
test_step_mdp_speed[False-True-True-True-True] 0.1008ms 44.0843μs 22.6838 KOps/s 22.5634 KOps/s $\color{#35bf28}+0.53\%$
test_step_mdp_speed[False-True-True-True-False] 60.2330μs 28.6476μs 34.9069 KOps/s 35.2075 KOps/s $\color{#d91a1a}-0.85\%$
test_step_mdp_speed[False-True-True-False-True] 57.6330μs 28.5741μs 34.9967 KOps/s 33.9738 KOps/s $\color{#35bf28}+3.01\%$
test_step_mdp_speed[False-True-True-False-False] 44.2720μs 17.4442μs 57.3257 KOps/s 55.4850 KOps/s $\color{#35bf28}+3.32\%$
test_step_mdp_speed[False-True-False-True-True] 82.3940μs 47.8612μs 20.8937 KOps/s 20.9161 KOps/s $\color{#d91a1a}-0.11\%$
test_step_mdp_speed[False-True-False-True-False] 67.6330μs 31.2973μs 31.9517 KOps/s 31.9100 KOps/s $\color{#35bf28}+0.13\%$
test_step_mdp_speed[False-True-False-False-True] 3.2909ms 31.5506μs 31.6952 KOps/s 31.9576 KOps/s $\color{#d91a1a}-0.82\%$
test_step_mdp_speed[False-True-False-False-False] 44.8020μs 20.2593μs 49.3600 KOps/s 49.3209 KOps/s $\color{#35bf28}+0.08\%$
test_step_mdp_speed[False-False-True-True-True] 82.9650μs 49.9542μs 20.0183 KOps/s 19.8589 KOps/s $\color{#35bf28}+0.80\%$
test_step_mdp_speed[False-False-True-True-False] 61.1630μs 33.6265μs 29.7385 KOps/s 29.5648 KOps/s $\color{#35bf28}+0.59\%$
test_step_mdp_speed[False-False-True-False-True] 62.6030μs 32.2011μs 31.0548 KOps/s 32.2027 KOps/s $\color{#d91a1a}-3.56\%$
test_step_mdp_speed[False-False-True-False-False] 49.0830μs 20.4487μs 48.9030 KOps/s 49.0135 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[False-False-False-True-True] 85.5540μs 51.6912μs 19.3457 KOps/s 19.2967 KOps/s $\color{#35bf28}+0.25\%$
test_step_mdp_speed[False-False-False-True-False] 69.0830μs 36.1451μs 27.6662 KOps/s 27.8798 KOps/s $\color{#d91a1a}-0.77\%$
test_step_mdp_speed[False-False-False-False-True] 55.9820μs 33.3494μs 29.9855 KOps/s 29.5716 KOps/s $\color{#35bf28}+1.40\%$
test_step_mdp_speed[False-False-False-False-False] 54.1730μs 22.8833μs 43.7000 KOps/s 44.4720 KOps/s $\color{#d91a1a}-1.74\%$
test_values[generalized_advantage_estimate-True-True] 24.3824ms 23.6007ms 42.3715 Ops/s 41.9631 Ops/s $\color{#35bf28}+0.97\%$
test_values[vec_generalized_advantage_estimate-True-True] 93.6725ms 2.7429ms 364.5782 Ops/s 357.2061 Ops/s $\color{#35bf28}+2.06\%$
test_values[td0_return_estimate-False-False] 82.7440μs 62.6851μs 15.9527 KOps/s 15.3617 KOps/s $\color{#35bf28}+3.85\%$
test_values[td1_return_estimate-False-False] 53.0789ms 52.5244ms 19.0388 Ops/s 18.8373 Ops/s $\color{#35bf28}+1.07\%$
test_values[vec_td1_return_estimate-False-False] 1.3090ms 1.0497ms 952.6092 Ops/s 952.1151 Ops/s $\color{#35bf28}+0.05\%$
test_values[td_lambda_return_estimate-True-False] 83.8790ms 83.1147ms 12.0316 Ops/s 11.5132 Ops/s $\color{#35bf28}+4.50\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3275ms 1.0468ms 955.2675 Ops/s 949.9064 Ops/s $\color{#35bf28}+0.56\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 24.8056ms 24.0223ms 41.6280 Ops/s 39.1028 Ops/s $\textbf{\color{#35bf28}+6.46\%}$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 0.9969ms 0.7147ms 1.3992 KOps/s 1.3968 KOps/s $\color{#35bf28}+0.17\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7171ms 0.6360ms 1.5723 KOps/s 1.5570 KOps/s $\color{#35bf28}+0.98\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.4809ms 1.4491ms 690.0809 Ops/s 688.5035 Ops/s $\color{#35bf28}+0.23\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.6919ms 0.6502ms 1.5379 KOps/s 1.5271 KOps/s $\color{#35bf28}+0.71\%$
test_dqn_speed[False-None] 6.6490ms 1.2990ms 769.8208 Ops/s 758.0189 Ops/s $\color{#35bf28}+1.56\%$
test_dqn_speed[False-backward] 1.9382ms 1.8248ms 548.0104 Ops/s 542.2139 Ops/s $\color{#35bf28}+1.07\%$
test_dqn_speed[True-None] 1.1706ms 0.5773ms 1.7322 KOps/s 1.8093 KOps/s $\color{#d91a1a}-4.26\%$
test_dqn_speed[True-backward] 1.5185ms 1.1928ms 838.3689 Ops/s 911.0420 Ops/s $\textbf{\color{#d91a1a}-7.98\%}$
test_dqn_speed[reduce-overhead-None] 0.9327ms 0.5644ms 1.7717 KOps/s 1.7358 KOps/s $\color{#35bf28}+2.07\%$
test_dqn_speed[reduce-overhead-backward] 1.0718ms 0.9947ms 1.0053 KOps/s 977.8660 Ops/s $\color{#35bf28}+2.80\%$
test_ddpg_speed[False-None] 2.9223ms 2.6864ms 372.2391 Ops/s 368.6385 Ops/s $\color{#35bf28}+0.98\%$
test_ddpg_speed[False-backward] 3.9313ms 3.8465ms 259.9763 Ops/s 257.7526 Ops/s $\color{#35bf28}+0.86\%$
test_ddpg_speed[True-None] 1.3016ms 1.2324ms 811.4198 Ops/s 794.9210 Ops/s $\color{#35bf28}+2.08\%$
test_ddpg_speed[True-backward] 2.2261ms 2.1771ms 459.3230 Ops/s 412.6902 Ops/s $\textbf{\color{#35bf28}+11.30\%}$
test_ddpg_speed[reduce-overhead-None] 1.5798ms 1.2523ms 798.5302 Ops/s 798.8116 Ops/s $\color{#d91a1a}-0.04\%$
test_ddpg_speed[reduce-overhead-backward] 2.2435ms 2.1937ms 455.8542 Ops/s 454.0432 Ops/s $\color{#35bf28}+0.40\%$
test_sac_speed[False-None] 8.0433ms 7.4460ms 134.3001 Ops/s 132.7131 Ops/s $\color{#35bf28}+1.20\%$
test_sac_speed[False-backward] 11.0157ms 10.5147ms 95.1046 Ops/s 94.4015 Ops/s $\color{#35bf28}+0.74\%$
test_sac_speed[True-None] 2.3910ms 2.0174ms 495.6923 Ops/s 490.1966 Ops/s $\color{#35bf28}+1.12\%$
test_sac_speed[True-backward] 4.2852ms 3.9162ms 255.3508 Ops/s 222.4287 Ops/s $\textbf{\color{#35bf28}+14.80\%}$
test_sac_speed[reduce-overhead-None] 2.3693ms 2.0195ms 495.1649 Ops/s 484.8210 Ops/s $\color{#35bf28}+2.13\%$
test_sac_speed[reduce-overhead-backward] 3.9928ms 3.9101ms 255.7452 Ops/s 253.1944 Ops/s $\color{#35bf28}+1.01\%$
test_redq_speed[False-None] 14.6828ms 10.0530ms 99.4725 Ops/s 99.5768 Ops/s $\color{#d91a1a}-0.10\%$
test_redq_speed[False-backward] 17.3583ms 16.6604ms 60.0226 Ops/s 60.0770 Ops/s $\color{#d91a1a}-0.09\%$
test_redq_speed[True-None] 3.7614ms 3.5281ms 283.4414 Ops/s 284.8301 Ops/s $\color{#d91a1a}-0.49\%$
test_redq_speed[True-backward] 8.8259ms 8.4617ms 118.1793 Ops/s 118.5804 Ops/s $\color{#d91a1a}-0.34\%$
test_redq_speed[reduce-overhead-None] 3.8717ms 3.4642ms 288.6711 Ops/s 284.0154 Ops/s $\color{#35bf28}+1.64\%$
test_redq_speed[reduce-overhead-backward] 8.8283ms 8.4824ms 117.8907 Ops/s 119.3478 Ops/s $\color{#d91a1a}-1.22\%$
test_redq_deprec_speed[False-None] 10.8292ms 10.4418ms 95.7687 Ops/s 96.3872 Ops/s $\color{#d91a1a}-0.64\%$
test_redq_deprec_speed[False-backward] 15.5903ms 15.1234ms 66.1225 Ops/s 67.3092 Ops/s $\color{#d91a1a}-1.76\%$
test_redq_deprec_speed[True-None] 3.5603ms 3.2130ms 311.2372 Ops/s 300.7175 Ops/s $\color{#35bf28}+3.50\%$
test_redq_deprec_speed[True-backward] 7.8270ms 7.0198ms 142.4547 Ops/s 140.7557 Ops/s $\color{#35bf28}+1.21\%$
test_redq_deprec_speed[reduce-overhead-None] 3.3668ms 3.1984ms 312.6592 Ops/s 303.8319 Ops/s $\color{#35bf28}+2.91\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.2035ms 7.0083ms 142.6880 Ops/s 140.1459 Ops/s $\color{#35bf28}+1.81\%$
test_td3_speed[False-None] 7.5925ms 7.3908ms 135.3029 Ops/s 132.4589 Ops/s $\color{#35bf28}+2.15\%$
test_td3_speed[False-backward] 10.3543ms 10.1553ms 98.4710 Ops/s 96.6295 Ops/s $\color{#35bf28}+1.91\%$
test_td3_speed[True-None] 1.9899ms 1.8913ms 528.7376 Ops/s 515.8884 Ops/s $\color{#35bf28}+2.49\%$
test_td3_speed[True-backward] 3.7923ms 3.6654ms 272.8223 Ops/s 220.4612 Ops/s $\textbf{\color{#35bf28}+23.75\%}$
test_td3_speed[reduce-overhead-None] 1.9054ms 1.8817ms 531.4474 Ops/s 527.7487 Ops/s $\color{#35bf28}+0.70\%$
test_td3_speed[reduce-overhead-backward] 3.7821ms 3.6842ms 271.4302 Ops/s 271.3239 Ops/s $\color{#35bf28}+0.04\%$
test_cql_speed[False-None] 27.4775ms 24.5423ms 40.7459 Ops/s 40.9170 Ops/s $\color{#d91a1a}-0.42\%$
test_cql_speed[False-backward] 38.7951ms 34.3154ms 29.1414 Ops/s 29.6919 Ops/s $\color{#d91a1a}-1.85\%$
test_cql_speed[True-None] 11.1403ms 10.8296ms 92.3396 Ops/s 93.5935 Ops/s $\color{#d91a1a}-1.34\%$
test_cql_speed[True-backward] 16.8875ms 16.6322ms 60.1243 Ops/s 61.0109 Ops/s $\color{#d91a1a}-1.45\%$
test_cql_speed[reduce-overhead-None] 11.4994ms 10.8906ms 91.8220 Ops/s 94.0530 Ops/s $\color{#d91a1a}-2.37\%$
test_cql_speed[reduce-overhead-backward] 17.1649ms 16.6273ms 60.1420 Ops/s 61.2755 Ops/s $\color{#d91a1a}-1.85\%$
test_a2c_speed[False-None] 5.8459ms 5.2501ms 190.4719 Ops/s 192.0875 Ops/s $\color{#d91a1a}-0.84\%$
test_a2c_speed[False-backward] 12.7993ms 11.5684ms 86.4423 Ops/s 86.8246 Ops/s $\color{#d91a1a}-0.44\%$
test_a2c_speed[True-None] 3.3461ms 3.0101ms 332.2126 Ops/s 328.5256 Ops/s $\color{#35bf28}+1.12\%$
test_a2c_speed[True-backward] 8.7282ms 8.5135ms 117.4606 Ops/s 116.3671 Ops/s $\color{#35bf28}+0.94\%$
test_a2c_speed[reduce-overhead-None] 3.1848ms 3.0409ms 328.8544 Ops/s 326.7876 Ops/s $\color{#35bf28}+0.63\%$
test_a2c_speed[reduce-overhead-backward] 8.8854ms 8.3935ms 119.1393 Ops/s 104.5063 Ops/s $\textbf{\color{#35bf28}+14.00\%}$
test_ppo_speed[False-None] 7.3343ms 5.4657ms 182.9588 Ops/s 179.8795 Ops/s $\color{#35bf28}+1.71\%$
test_ppo_speed[False-backward] 12.2637ms 11.8968ms 84.0560 Ops/s 83.9504 Ops/s $\color{#35bf28}+0.13\%$
test_ppo_speed[True-None] 3.7317ms 3.4584ms 289.1511 Ops/s 285.9591 Ops/s $\color{#35bf28}+1.12\%$
test_ppo_speed[True-backward] 8.3677ms 8.2315ms 121.4847 Ops/s 123.0402 Ops/s $\color{#d91a1a}-1.26\%$
test_ppo_speed[reduce-overhead-None] 3.7845ms 3.4082ms 293.4130 Ops/s 288.7676 Ops/s $\color{#35bf28}+1.61\%$
test_ppo_speed[reduce-overhead-backward] 8.3326ms 8.1288ms 123.0194 Ops/s 121.3992 Ops/s $\color{#35bf28}+1.33\%$
test_reinforce_speed[False-None] 4.6977ms 4.3236ms 231.2911 Ops/s 225.4900 Ops/s $\color{#35bf28}+2.57\%$
test_reinforce_speed[False-backward] 7.4003ms 7.1167ms 140.5149 Ops/s 140.4518 Ops/s $\color{#35bf28}+0.04\%$
test_reinforce_speed[True-None] 2.5703ms 2.1994ms 454.6689 Ops/s 433.7722 Ops/s $\color{#35bf28}+4.82\%$
test_reinforce_speed[True-backward] 7.4930ms 7.0014ms 142.8285 Ops/s 142.7222 Ops/s $\color{#35bf28}+0.07\%$
test_reinforce_speed[reduce-overhead-None] 2.6151ms 2.2029ms 453.9422 Ops/s 448.0480 Ops/s $\color{#35bf28}+1.32\%$
test_reinforce_speed[reduce-overhead-backward] 7.1894ms 7.0370ms 142.1057 Ops/s 141.5807 Ops/s $\color{#35bf28}+0.37\%$
test_iql_speed[False-None] 21.4572ms 19.1562ms 52.2024 Ops/s 50.9575 Ops/s $\color{#35bf28}+2.44\%$
test_iql_speed[False-backward] 30.1352ms 29.5884ms 33.7970 Ops/s 33.3650 Ops/s $\color{#35bf28}+1.29\%$
test_iql_speed[True-None] 8.8154ms 6.8396ms 146.2084 Ops/s 147.5920 Ops/s $\color{#d91a1a}-0.94\%$
test_iql_speed[True-backward] 15.6726ms 15.2702ms 65.4870 Ops/s 62.3277 Ops/s $\textbf{\color{#35bf28}+5.07\%}$
test_iql_speed[reduce-overhead-None] 7.1529ms 6.7414ms 148.3364 Ops/s 147.7770 Ops/s $\color{#35bf28}+0.38\%$
test_iql_speed[reduce-overhead-backward] 15.6751ms 15.3005ms 65.3575 Ops/s 63.5346 Ops/s $\color{#35bf28}+2.87\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.5288ms 6.3502ms 157.4765 Ops/s 157.8092 Ops/s $\color{#d91a1a}-0.21\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.2992s 0.4446ms 2.2494 KOps/s 3.6578 KOps/s $\textbf{\color{#d91a1a}-38.50\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5007ms 0.2568ms 3.8943 KOps/s 3.6933 KOps/s $\textbf{\color{#35bf28}+5.44\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3984ms 6.1815ms 161.7721 Ops/s 166.3042 Ops/s $\color{#d91a1a}-2.73\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.8007ms 0.2319ms 4.3125 KOps/s 2.9084 KOps/s $\textbf{\color{#35bf28}+48.28\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4944ms 0.2658ms 3.7619 KOps/s 3.5361 KOps/s $\textbf{\color{#35bf28}+6.39\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.4410ms 1.2144ms 823.4186 Ops/s 744.9448 Ops/s $\textbf{\color{#35bf28}+10.53\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.3726ms 1.1571ms 864.2004 Ops/s 765.1860 Ops/s $\textbf{\color{#35bf28}+12.94\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.4565ms 6.3384ms 157.7681 Ops/s 160.1609 Ops/s $\color{#d91a1a}-1.49\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.2307ms 0.3756ms 2.6625 KOps/s 2.2587 KOps/s $\textbf{\color{#35bf28}+17.88\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6719ms 0.4028ms 2.4825 KOps/s 2.2040 KOps/s $\textbf{\color{#35bf28}+12.64\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.4195ms 6.2375ms 160.3216 Ops/s 165.7774 Ops/s $\color{#d91a1a}-3.29\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.8059ms 0.3166ms 3.1588 KOps/s 2.9821 KOps/s $\textbf{\color{#35bf28}+5.93\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6296ms 0.3449ms 2.8992 KOps/s 4.7013 KOps/s $\textbf{\color{#d91a1a}-38.33\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 9.7031ms 6.1832ms 161.7273 Ops/s 167.9606 Ops/s $\color{#d91a1a}-3.71\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9269ms 0.2605ms 3.8394 KOps/s 3.5033 KOps/s $\textbf{\color{#35bf28}+9.59\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4358ms 0.2499ms 4.0023 KOps/s 3.9242 KOps/s $\color{#35bf28}+1.99\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5309ms 6.3592ms 157.2520 Ops/s 157.8292 Ops/s $\color{#d91a1a}-0.37\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.6012ms 0.4155ms 2.4067 KOps/s 2.2972 KOps/s $\color{#35bf28}+4.77\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7381ms 0.4119ms 2.4277 KOps/s 2.4451 KOps/s $\color{#d91a1a}-0.71\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.7700ms 5.2044ms 192.1467 Ops/s 188.4277 Ops/s $\color{#35bf28}+1.97\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 6.9708ms 2.0110ms 497.2712 Ops/s 493.6257 Ops/s $\color{#35bf28}+0.74\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 8.6045ms 1.2260ms 815.6763 Ops/s 825.7148 Ops/s $\color{#d91a1a}-1.22\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4105s 13.3837ms 74.7178 Ops/s 186.2069 Ops/s $\textbf{\color{#d91a1a}-59.87\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.8279ms 2.0048ms 498.8113 Ops/s 490.5046 Ops/s $\color{#35bf28}+1.69\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.3592ms 1.2180ms 821.0289 Ops/s 815.7058 Ops/s $\color{#35bf28}+0.65\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.1000ms 5.4628ms 183.0574 Ops/s 180.9981 Ops/s $\color{#35bf28}+1.14\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.9667ms 2.0840ms 479.8489 Ops/s 477.9396 Ops/s $\color{#35bf28}+0.40\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 7.0758ms 1.3551ms 737.9682 Ops/s 724.2975 Ops/s $\color{#35bf28}+1.89\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Tests Incomplete or broken unit tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants