Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View cceyda's full-sized avatar
:octocat:
:octocat:

Organizations

@croquiscom @Hugging-Face-Supporter @Hugging-Face-Helping-Hand

Block or report cceyda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

✔️ Annotation

Data annotation tools
20 repositories

🖼️ Visualizations

Repos for visualizing, graphing, model interpretability tools... all that good stuff
46 repositories

Adversarial

5 repositories

🗳️ Archived

Projects no longer active

🎨 Color

39 repositories

🛢Data

Repos of datasets & data wrangling libs
25 repositories

🎁 DBs

27 repositories

🎨 Design

3 repositories

Starred repositories

Showing results
Python 110 14 Updated Sep 23, 2025

Detection and automatic updating of Korean datasets uploaded to Hugging Face

Python 12 1 Updated Oct 14, 2025

Sample base images for Databricks Container Services

Jupyter Notebook 181 125 Updated Oct 15, 2025

A very fast SIMD-first image comparison library (with nodejs API)

Zig 2,615 95 Updated Oct 22, 2025

Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau

Python 106 12 Updated Oct 14, 2025

HSEB: Hybrid Search Engine Benchmark

Python 12 1 Updated Oct 5, 2025

Efficient RWKV inference engine. RWKV7 7.2B fp16 decoding 10250 tps @ single 5090.

Python 47 16 Updated Oct 9, 2025

Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting irrelevant tokens from its vocabulary. This repository contain…

Python 57 5 Updated Oct 25, 2024

Python project to ship N8N execution data to Langfuse using OTEL API.

Python 3 1 Updated Oct 20, 2025

Ko-SyllaBERT: A Syllable-Based Efficient and Robust Korean Language Model for Real-World Noise and Typographical Errors

Python 3 1 Updated Jun 5, 2025

biasing the universal tokenizer and an attempt to optimize compression rates in multilingual compression

Python 5 1 Updated Aug 28, 2025

Code for Zero-Shot Tokenizer Transfer

Python 138 12 Updated Jan 14, 2025

A simple tool for adapting a pretrained Huggingface model to a new vocabulary with (almost) no training.

Python 12 1 Updated Aug 12, 2025

Datamodels for hugging face tokenizers

Python 85 4 Updated Sep 26, 2025

Run any GUI app in the terminal❗

TypeScript 6,685 149 Updated Oct 12, 2025

A massively multilingual modern encoder language model

Python 102 8 Updated Oct 13, 2025

▁▅▆▃▅ Git quick statistics is a simple and efficient way to access various statistics in git repository.

Shell 6,822 274 Updated Sep 2, 2025

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 16,842 3,968 Updated Jul 21, 2025

Zero-Config Code Flow for Claude code & Codex

TypeScript 2,890 212 Updated Oct 23, 2025

한국 서비스에 이용 가능한 Public API 모음 | Public APIs Available for Korean Services

Python 940 71 Updated Oct 17, 2025

An n8n community node that brings Langfuse observability to your OpenAI chat workflows.

TypeScript 25 1 Updated Sep 24, 2025

Cursor for design - Open Source

TypeScript 5,013 554 Updated Sep 10, 2025

Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX

Python 162 8 Updated Sep 18, 2025

GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction

Python 80 6 Updated Jul 31, 2024

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,606 531 Updated Oct 16, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,601 131 Updated Jan 24, 2025

🌠 Manage your shell commands.

Rust 5,869 139 Updated Sep 5, 2025

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

TypeScript 3,973 199 Updated Oct 22, 2025

PyTorch Extension Library of Optimized Scatter Operations

Python 1,694 200 Updated Aug 12, 2025

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 16,508 1,151 Updated Oct 4, 2025
Next