-
Notifications
You must be signed in to change notification settings - Fork 527
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Migrate distillation MaxTextCheckpointManager to reflect Tunix updated Checkpoint Manager.
#4064
opened Jun 4, 2026 by
copybara-service
Bot
Loading…
Add elastic training (Pathways + xpk) guide under docs/
#4063
opened Jun 4, 2026 by
inardini
Loading…
Fix deprecated setup.sh and setup_post_training_requirements.sh documentation references
pull ready
#4059
opened Jun 3, 2026 by
bvandermoon
Collaborator
Loading…
4 tasks done
Fix cuDNN SDPA autoregressive inference when KV cache batch differs from decode batch
#4058
opened Jun 3, 2026 by
sfvaroglu
Contributor
Loading…
4 tasks done
Remove hardcoded vision hyperparams from Qwen mm preprocessor
#4057
opened Jun 3, 2026 by
hengtaoguo
Collaborator
Loading…
4 tasks done
Fix sparse distillation loss and speed up teacher top-k logit saving
pull ready
#4056
opened Jun 3, 2026 by
ajkv-google
Collaborator
Loading…
4 tasks done
[pallas:sc] Remove
use_tc_tiling_on_sc=True, because this is now a default
#4055
opened Jun 3, 2026 by
copybara-service
Bot
Loading…
Update vllm/tpu-inference commit and fix vllm installation
#4054
opened Jun 3, 2026 by
SurbhiJainUSC
Collaborator
•
Draft
4 tasks done
Qwen3 Coder 480B inference sharding changes for MaxText.
#4052
opened Jun 3, 2026 by
copybara-service
Bot
Loading…
Support of gdn kernel from tpu-inference
gemini-review
#4051
opened Jun 3, 2026 by
khatwanimohit
Collaborator
Loading…
4 tasks done
[RL] Honor tokenizer chat templates for base models that lack one
#4049
opened Jun 3, 2026 by
dasoto
Collaborator
Loading…
4 tasks done
[WIP-exp1] microsoft/Phi-4-mini-instruct
#4047
opened Jun 3, 2026 by
hengtaoguo
Collaborator
•
Draft
4 tasks
fix: raise RuntimeError when checkpoint step >= config.steps
bug
Something isn't working
gemini-review
pull ready
#4046
opened Jun 2, 2026 by
Dr-Left
Collaborator
Loading…
4 tasks done
Add intermediate eval hook: fire evaluate() every eval_interval outer steps
#4044
opened Jun 2, 2026 by
py4
Collaborator
Loading…
4 tasks done
[Qwen3.5] Add moe weight sync script for 35b model
gemini-review
pull ready
#4041
opened Jun 2, 2026 by
Rohan-Bierneni
Collaborator
Loading…
4 tasks done
Support Qwix quantization on NNX
#4040
opened Jun 2, 2026 by
hsuan-lun-chiang
Collaborator
Loading…
4 tasks done
update rope_max_timescale to 1M for qwen3-30b-a3b-base to match HF
gemini-review
#4039
opened Jun 2, 2026 by
JamesDeng42
Collaborator
Loading…
4 tasks done
[NNX] NNX migration (12/N): delete Linen code paths, classes, and NNX compatibility flags
#4038
opened Jun 2, 2026 by
ecnal-cienet
Collaborator
•
Draft
4 tasks done
Add MoE router similarity and expert fraction distillation metrics
#4037
opened Jun 1, 2026 by
JamesDeng42
Collaborator
Loading…
4 tasks done
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.