Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View falloficarus22's full-sized avatar

Block or report falloficarus22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
falloficarus22/README.md

Hi, I'm Abhishek Shinde

AI Engineer focused on AI Agents, Deep Learning, and NLP.
Self-taught, engineering strong end-to-end AI systems just for fun.


Focus Areas

  • AI Agent architectures & tool-using systems
  • Deep Learning with PyTorch
  • NLP model development & modern transformer techniques
  • Reinforcement Learning algorithms

Projects

BitGrad

A custom deep learning optimizer inspired by biological learning and gradient-based updates.
Designed to experiment with alternative optimization behaviors beyond Adam/SGD.
Inspired by Karpathy's micrograd.

PPO Trading Agent

A reinforcement learning agent trained using Proximal Policy Optimization for algorithmic trading.
Focus on stable returns, reward engineering, environment design, and policy performance monitoring.

Echelon (WIP)

An AI-powered chess engine, combining:

  • NNUE-based evaluation
  • Search-based planning
  • Agentic decision loops
    A personal research project to explore competitive game-playing AI.

Tech Stack

Languages & Core Tools

  • Python, Linux, Git/GitHub
  • Docker, FastAPI
  • Experiment tracking (Weights & Biases)
  • Google Colab / Kaggle workflows
  • Model deployment fundamentals

Deep Learning

  • PyTorch, NumPy, Pandas, Scikit-Learn, Matplotlib/Seaborn
  • Transformer models, attention mechanisms
  • CNNs, RNNs, sequence models
  • Optimization algorithms (including custom ones like BitGrad)
  • Reinforcement Learning (PPO, DQN fundamentals)

NLP

  • Hugging Face transformers, SpaCy
  • Tokenization, embeddings, sequence modeling
  • LLM fine-tuning & prompt engineering
  • Text classification, generation, and evaluation

Currently Learning

  • Advanced AI Agent frameworks & multi-agent systems
  • Scaling laws, evals, and reliability for AI systems
  • Production-grade NLP pipelines
  • Chess engine search optimization (Echelon)

Contact

Open to collaboration, contributions, and interesting discussions in AI.
Let’s build something impactful. Email: [email protected]

Pinned Loading

  1. bitgrad bitgrad Public

    A tiny autograd + neural network + visualization library built from scratch Inspired by Karpathy’s micrograd, extended with a visualization engine and CLI.

    Python

  2. PPO-Trading-Agent PPO-Trading-Agent Public

    Deep Reinforcement Learning Trading Bot for crypto using Proximal Policy Optimization and order block analysis

    Python 1 1