Kwai-Klear
Popular repositories Loading
-
KlearReasoner
KlearReasoner PublicKlear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
-
CE-GPPO
CE-GPPO PublicForked from Kwai-Klear/KlearReasoner
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning
Python 13
Repositories
- mini-swe-agent-plus Public Forked from SWE-agent/mini-swe-agent
mini-swe-agent-plus: a tiny (~100 LOC) GitHub issue fixer—now with a robust multi-line text edit tool.
Kwai-Klear/mini-swe-agent-plus’s past year of commit activity - Klear-AgentForge Public
Kwai-Klear/Klear-AgentForge’s past year of commit activity - CE-GPPO Public Forked from Kwai-Klear/KlearReasoner
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning
Kwai-Klear/CE-GPPO’s past year of commit activity - KlearReasoner Public
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
Kwai-Klear/KlearReasoner’s past year of commit activity - Leanabell-Prover-V2 Public Forked from Leanabell-LM/Leanabell-Prover-V2
Verifier-integrated reasoning for formal theorem proving via RL.
Kwai-Klear/Leanabell-Prover-V2’s past year of commit activity - Klear-Qwen3-Thinking-Preview Public
A practical tutorial on how to effectively use RL to enhance reasoning capabilities on the Qwen3-8B model.
Kwai-Klear/Klear-Qwen3-Thinking-Preview’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…