- 
                Notifications
    
You must be signed in to change notification settings  - Fork 4
 
Fixed issue with batching and file handling logic #42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| 
           I had to really push things before I saw issues. For me concurrency was never an issue, my issue was when I pushed things so far I ran out of RAM and the program crashed. 
 Finished updating database index settings
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 10/10 [00:27<00:00,  2.77s/it]
Handling file chunks: 1it [00:27, 27.66s/it]βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ               | 9/10 [00:27<00:00,  1.30it/s]
Bulk index took 27.7018 seconds
Finished running benchmarksWith  My machine
  | 
    
| 
           Could you try  I think it makes more sense to keep batch sizes low-ish but add more files to process concurrently in a real situation. I'm still a bit surprised that the results are OS-specific tho -- do you think the difference between 16GB RAM on my end and 32 GB on yours makes all the difference, or is there an OS angle?  | 
    
| 
           With Mac being Unix under the hood, it seems to me that the OS is unlikely to be the difference. I had htop running durring my runs and could see the exact point my RAM maxed out and the program would crash there every time. Maybe you could try that and see if your RAM is maxed when the error returns. My program would crash all together and I wouldn't get an error at all, but maybe Mac is handling that better and giving you the error instead. 
 Finished updating database index settings
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.70s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:14<00:00,  2.92s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.69s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.67s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.70s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.67s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.65s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.69s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.66s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.63s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.64s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.66s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.65s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.65s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.62s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.63s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.64s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.63s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.62s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:13<00:00,  2.64s/it]
Handling file chunks: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 20/20 [04:26<00:00, 13.34s/it]
Bulk index took 266.8564 seconds
Finished running benchmarks
 Finished updating database index settings
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 20/20 [00:55<00:00,  2.76s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 20/20 [00:54<00:00,  2.74s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 20/20 [00:53<00:00,  2.69s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 20/20 [00:54<00:00,  2.70s/it]
100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 20/20 [00:53<00:00,  2.68s/it]
Handling file chunks: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [04:31<00:00, 54.29s/it]
Bulk index took 271.4923 seconds
Finished running benchmarks | 
    
Detailed description of issue and fixes are given in the
meilisearch_python_asyncrepo.In a nutshell:
I'm not sure if this same workflow is required on Ubuntu, so would appreciate if you could take a look at this and see if a large batch size + more files being processed concurrently is possible in Ubuntu @sanders41. Thanks!