Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Snehalv dsv4 muon
#4065 opened Jun 4, 2026 by snehalv2002 Collaborator Draft
4 tasks
Add elastic training (Pathways + xpk) guide under docs/
#4063 opened Jun 4, 2026 by inardini Loading…
[WIP] Ragged kernel updates
#4061 opened Jun 4, 2026 by RissyRan Collaborator Draft
4 tasks
Measure routing mismatch for Qwen models
#4060 opened Jun 3, 2026 by xuefgu Collaborator Draft
4 tasks done
Fix cuDNN SDPA autoregressive inference when KV cache batch differs from decode batch
#4058 opened Jun 3, 2026 by sfvaroglu Contributor Loading…
4 tasks done
Remove hardcoded vision hyperparams from Qwen mm preprocessor
#4057 opened Jun 3, 2026 by hengtaoguo Collaborator Loading…
4 tasks done
Fix sparse distillation loss and speed up teacher top-k logit saving pull ready
#4056 opened Jun 3, 2026 by ajkv-google Collaborator Loading…
4 tasks done
Update vllm/tpu-inference commit and fix vllm installation
#4054 opened Jun 3, 2026 by SurbhiJainUSC Collaborator Draft
4 tasks done
Enable Gemma 4 E2B / E4B inference via vLLM RPA gemini-review
#4053 opened Jun 3, 2026 by gagika Collaborator Draft
4 tasks done
Support of gdn kernel from tpu-inference gemini-review
#4051 opened Jun 3, 2026 by khatwanimohit Collaborator Loading…
4 tasks done
[RL] Honor tokenizer chat templates for base models that lack one
#4049 opened Jun 3, 2026 by dasoto Collaborator Loading…
4 tasks done
[WIP-exp1] microsoft/Phi-4-mini-instruct
#4047 opened Jun 3, 2026 by hengtaoguo Collaborator Draft
4 tasks
fix: raise RuntimeError when checkpoint step >= config.steps bug Something isn't working gemini-review pull ready
#4046 opened Jun 2, 2026 by Dr-Left Collaborator Loading…
4 tasks done
Add intermediate eval hook: fire evaluate() every eval_interval outer steps
#4044 opened Jun 2, 2026 by py4 Collaborator Loading…
4 tasks done
[Qwen3.5] Add moe weight sync script for 35b model gemini-review pull ready
#4041 opened Jun 2, 2026 by Rohan-Bierneni Collaborator Loading…
4 tasks done
Support Qwix quantization on NNX
#4040 opened Jun 2, 2026 by hsuan-lun-chiang Collaborator Loading…
4 tasks done
update rope_max_timescale to 1M for qwen3-30b-a3b-base to match HF gemini-review
#4039 opened Jun 2, 2026 by JamesDeng42 Collaborator Loading…
4 tasks done
Add MoE router similarity and expert fraction distillation metrics
#4037 opened Jun 1, 2026 by JamesDeng42 Collaborator Loading…
4 tasks done
Update gather reduce kernel aligning with tpu inference
#4035 opened Jun 1, 2026 by NuojCheng Collaborator Draft
4 tasks
ProTip! Add no:assignee to see everything that’s not assigned.