Jefersonalves

♟️

Jeferson Alves Jefersonalves

♟️

Data Tech Lead | Data Engineer | Software Engineer

60 followers · 138 following

Universidade de Brasília (UnB)
Brasília, Brasil
jefersonalves.com
in/ferreirajeferson

Achievements

Organizations

Stars

moj-analytical-services / splink

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Python 1,843 205 Updated Dec 25, 2025

langchain-ai / deepagents

Deepagents is an agent harness built on langchain and langgraph. Deep agents are equipped with a planning tool, a filesystem backend, and the ability to spawn subagents - making them well-equipped …

Python 7,549 1,163 Updated Dec 23, 2025

littleblah / senior-engineer-checklist

Senior Engineer CheckList

HTML 545 37 Updated Sep 13, 2021

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 88,798 10,213 Updated Dec 27, 2025

Unstructured-IO / unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 13,501 1,115 Updated Dec 25, 2025

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 31,068 2,504 Updated Dec 26, 2025

chonkie-inc / chonkie

🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines

Python 3,410 217 Updated Dec 26, 2025

googleapis / genai-toolbox

MCP Toolbox for Databases is an open source MCP server for databases.

Go 11,966 1,074 Updated Dec 27, 2025

sparckles / Robyn

Robyn is a Super Fast Async Python Web Framework with a Rust runtime.

Python 6,830 312 Updated Dec 26, 2025

bartosz25 / data-engineering-design-patterns-book

Code snippets for Data Engineering Design Patterns book

Python 302 76 Updated Dec 16, 2025

scorzeth / anki-mcp-server

An MCP server for Anki

JavaScript 177 32 Updated Jan 8, 2025

langchain-ai / langchain-mcp-adapters

LangChain 🔌 MCP

Python 3,227 344 Updated Dec 15, 2025

TensorBlock / awesome-mcp-servers

A comprehensive collection of Model Context Protocol (MCP) servers

497 66 Updated Dec 9, 2025

langfuse / langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 19,776 1,940 Updated Dec 27, 2025

openai / openai-cookbook

Examples and guides for using the OpenAI API

Jupyter Notebook 69,996 11,764 Updated Dec 22, 2025

katanaml / sparrow

Structured data extraction and instruction calling with ML, LLM and Vision LLM

Python 5,077 509 Updated Dec 19, 2025

deepseek-ai / DeepSeek-R1

91,612 11,770 Updated Jun 27, 2025

Kaggle / kaggle-environments

Jupyter Notebook 360 161 Updated Dec 19, 2025

cgearhart / Chessnut

Python chess model

Python 76 19 Updated Nov 28, 2024

docling-project / docling

Get your documents ready for gen AI

Python 47,998 3,352 Updated Dec 24, 2025

observablehq / plot

A concise API for exploratory data visualization implementing a layered grammar of graphics

HTML 5,087 201 Updated Nov 14, 2025

observablehq / framework

A static site generator for data apps, dashboards, reports, and more. Observable Framework combines JavaScript on the front-end for interactive graphics with any language on the back-end for data a…

TypeScript 3,297 179 Updated May 7, 2025

StarRocks / starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 11,152 2,240 Updated Dec 27, 2025

Netflix / maestro

Maestro: Netflix’s Workflow Orchestrator

Java 3,688 252 Updated Dec 4, 2025

unitycatalog / unitycatalog

Open, Multi-modal Catalog for Data & AI

Java 3,236 558 Updated Dec 18, 2025

LucaCanali / sparkMeasure

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 805 159 Updated Nov 6, 2025

palantir / pyspark-style-guide

This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.

Python 1,204 158 Updated Sep 8, 2025

alexott / spark-playground

Playing with different packages of the Apache Spark

Scala 30 13 Updated Dec 26, 2025

datacontract / datacontract-cli

Enforce Data Contracts

Python 782 190 Updated Dec 22, 2025

1ambda / lakehouse

Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)

Kotlin 64 15 Updated Sep 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jeferson Alves Jefersonalves

Achievements

Achievements

Organizations

Block or report Jefersonalves

Stars

moj-analytical-services / splink

langchain-ai / deepagents

littleblah / senior-engineer-checklist

google-gemini / gemini-cli

Unstructured-IO / unstructured

stanfordnlp / dspy

chonkie-inc / chonkie

googleapis / genai-toolbox

sparckles / Robyn

bartosz25 / data-engineering-design-patterns-book

scorzeth / anki-mcp-server

langchain-ai / langchain-mcp-adapters

TensorBlock / awesome-mcp-servers

langfuse / langfuse

openai / openai-cookbook

katanaml / sparrow

deepseek-ai / DeepSeek-R1

Kaggle / kaggle-environments

cgearhart / Chessnut

docling-project / docling

observablehq / plot

observablehq / framework

StarRocks / starrocks

Netflix / maestro

unitycatalog / unitycatalog

LucaCanali / sparkMeasure

palantir / pyspark-style-guide

alexott / spark-playground

datacontract / datacontract-cli

1ambda / lakehouse