-
Carnegie Mellon University
- Pittsburgh
Stars
An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natural language prompts.
Infrastructure for researching self-driving databases
An open-source framework for training large multimodal models.
Apache Pinot - A realtime distributed OLAP datastore
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Beancount: Double-Entry Accounting from Text Files.
Making large AI models cheaper, faster and more accessible
FoundationDB - the open source, distributed, transactional key-value store
Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretation based) engines and compiling engines.
A composable and fully extensible C++ execution engine library for data management systems.
BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)
CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
A curated list of engineering blogs
[VLDB 2023 Vol 17] "An Empirical Evaluation of Columnar Storage Formats"
Incomplete Redis client and server implementation using Tokio - for learning purposes only
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
official code for "Large Language Models as Optimizers"
A simple chatbot frontend made using bootstrap and jquery
The open source, pluggable, nosql benchmarking suite.
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
The BusTub Relational Database Management System (Educational)