This repository is the official implementation of our paper "Preserving Fairness Generalization in Deepfake Detection", which has been accepted by CVPR 2024.

Python 59 4 Updated May 31, 2024

efeslab / LiteASR

[EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Python 132 5 Updated May 18, 2025

crate / crate

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

Java 4,319 581 Updated Oct 30, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,508 1,983 Updated Oct 27, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,017 3,168 Updated Nov 1, 2025

seanghay / khmercut

A (fast) Khmer word segmentation toolkit.

Python 10 6 Updated Feb 10, 2025

founded-labs / react-native-reusables

Bringing shadcn/ui to React Native. Beautifully crafted components with Nativewind, open source, and almost as easy to use.

TypeScript 7,271 267 Updated Oct 10, 2025

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,433 196 Updated May 14, 2025

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 324,994 52,975 Updated Nov 1, 2025

mlfoundations / open_clip

An open source implementation of CLIP.

Python 12,861 1,187 Updated Sep 21, 2025

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,981 695 Updated Feb 10, 2025

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 775 67 Updated Aug 27, 2025

MetythornPenn / youtube2mp3

Script to download YouTube videos and convert them to MP3 format using multiple CPU cores

Python 1 Updated Jun 8, 2024

MetythornPenn / youtube2mp3-node

JavaScript 1 Updated Jun 10, 2024

MetythornPenn / neovim-config

All plugins and configurations of neovim to supercharge vim user

Lua 1 Updated Jul 5, 2024

MetythornPenn / dev-environment-files

Forked from josean-dev/dev-environment-files

Lua 1 Updated Aug 21, 2024

MetythornPenn / ollama-docker

Running ollama with docker

Python 1 Updated Oct 14, 2024

metythorn MetythornPenn

Lists (19)

acleda

AI

ASR

backend

Data Engineer

Data Scientist

ecommerce

face_recognition

frontend

khmerllm

khmernlp

mobile

OCR

Recommendation System

server

Sigmoid_learning

sigmoid-shop

Tools

TTS

Stars