Stars
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
闻达:一个LLM调用平台。目标为针对特定环境的高效内容生成,同时考虑个人和中小企业的计算资源局限性,以及知识安全和私密性问题
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
DataCap is integrated software for data transformation, integration, and visualization. Support a variety of data sources, file types, big data related database, relational database, NoSQL database…
mall-tiny是一款基于SpringBoot+MyBatis-Plus的快速开发脚手架,拥有完整的权限管理功能,可对接Vue前端,开箱即用。
An easy to use, self-service open BI reporting and BI dashboard platform.
Apache Superset is a Data Visualization and Data Exploration Platform
Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
Compare tables within or across databases
《Software Engineering at Google》的中英文对译版本
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualiz…
Flink CDC is a streaming data integration tool
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...