Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ishine's full-sized avatar
  • gerzz.inc
  • shanghai

Block or report ishine

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SpikeMamba presents a novel integration of spiking neural networks (SNNs) with the Mamba state space model architecture, investigating the potential for biologically-inspired temporal dynamics in l…

Python 3 Updated Sep 9, 2025

Resources to develop programming and software development skills

HTML 28 11 Updated Sep 21, 2023

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

JavaScript 23,079 3,534 Updated Oct 22, 2025

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 29,224 2,582 Updated Oct 20, 2025

Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"

Python 10 1 Updated Oct 9, 2025

[ICML 2025] PyTorch Implementation of "OmniAudio: Generating Spatial Audio from 360-Degree Video"

Python 330 9 Updated Jun 27, 2025

Towards Fine-grained Audio Captioning with Multimodal Contextual Cues

Python 81 5 Updated Sep 29, 2025

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 3,557 461 Updated Oct 17, 2025

[ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"

Python 80 10 Updated Jan 17, 2025
Python 6 1 Updated May 30, 2025

Repository of ACL2023 paper: Unbalanced Optimal Transport for Unbalanced Word Alignment

Python 38 5 Updated Sep 13, 2023

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

152 3 Updated Jun 13, 2024

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,630 74 Updated Apr 18, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 4,790 669 Updated Aug 11, 2025

My clone repository

1 Updated Sep 2, 2025

A list of publicly available room impulse response datasets and scripts to download them.

Shell 514 46 Updated Oct 11, 2025

Multi-lingual AudioCaps

11 Updated Nov 20, 2023

s1: Simple test-time scaling

Python 6,581 766 Updated Jun 25, 2025

The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani, Daagare, and Ikposo. Each language includes 1000 hours of audio speech from indigenous speakers of the language. Of which 10…

HTML 9 4 Updated May 2, 2025

A free, licensed, and industrial animation dataset

69 5 Updated Jun 26, 2024

Python tool for converting files and office documents to Markdown.

Python 82,032 4,591 Updated Oct 20, 2025

[TACL 2024] MAPS enables LLMs🤖 to mimic the human😁 translation process.

Python 144 7 Updated Jun 7, 2024

Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech

Python 30 4 Updated Sep 18, 2024

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,973 476 Updated Mar 18, 2025

Transpiler of Python to many other languages

Python 1,041 67 Updated Sep 9, 2025

Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation

Python 7 1 Updated Sep 25, 2024

A Chinese Expressive Long-dialogue Speech Dataset with Scripts

Python 20 3 Updated Nov 11, 2024

A piano music dataset with Audio, Symbolic and Text labels

Python 33 Updated Mar 6, 2025

A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwise (training) and stepwise (sampling).

Python 44 7 Updated Aug 1, 2025
Next