Stars
FlagGems is an operator library for large language models implemented in the Triton Language.
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
Use the TPC-DS benchmark to test Spark SQL performance
The official GitHub page for the survey paper "A Survey of Large Language Models".
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
An industrial deep learning framework for high-dimension sparse data
CN-CppUserGroup-2019-1,lock-free queue demo
Ring-Log是一个高效简洁的C++异步日志, 其特点是效率高(每秒支持至少125万+日志写入)、易拓展,尤其适用于频繁写日志的场景
bigo-sg / brpc
Forked from apache/brpcMost common RPC framework used throughout Baidu, with 600,000+ instances and 500+ kinds of services, called "baidu-rpc" inside Baidu.
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Write-Optimized and High-Performance Hashing Index Scheme for Persistent Memory (OSDI 2018, TOS 2019)
Scripts and tools for troubleshooting and performance analysis in Linux. This includes dynamic tracing scripts with SystemTap both for system calls and for userspace function tracing.
A Prometheus exporter which uses eBPF to measure block IO request latency / size
The health-check tool monitors processes in various ways to help identify areas where it is consuming too many resources. One can trace one or more processes (including all their threads and child …
Powerstat measures the power consumption of a machine using the battery stats or the Intel RAPL interface. The output is like vmstat but also shows power consumption statistics. At the end of a run…
High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.