Popular repositories Loading
-
lunar-lander-dqn-agent
lunar-lander-dqn-agent PublicDeep Q-Learning agent for OpenAI LunarLander-v3 environment.
Python
-
Doctor_Chatbot
Doctor_Chatbot PublicInstruction-tuned LLaMA 2 chatbot fine-tuned with LoRA on real medical Q&A data. Built for conversational health-related queries using Transformers and PEFT.
Python
-
pacman-deepq-learning
pacman-deepq-learning PublicDeep Q-Learning agent trained to play Ms. Pac-Man using a convolutional neural network and experience replay. Built with PyTorch and Gymnasium.
Python
-
kungfu-a3c-agent
kungfu-a3c-agent PublicA3C-style parallel actor-critic agent trained to play Kung Fu Master (Atari) using PyTorch and Gymnasium. Includes parallel environments, shared network, and video playback.
Python
If the problem persists, check the GitHub status page or contact support.