metal : support argsort for ne00 > 1024 (#17247) * metal : refactor argsort * cont : sort chunks * cont : merge sorted buckets * cont : cleanup
metal : support argsort for ne00 > 1024 (#17247) * metal : refactor argsort * cont : sort chunks * cont : merge sorted buckets * cont : cleanup