Thanks to visit codestin.com
Credit goes to github.com

Skip to content

RAPTOR (Rapid AI-Powered Text and Object Recognition) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis, semantic search, and actionable insights. RAPTOR reducing manual tagging by 85% and making content discovery 10x faster.

License

Notifications You must be signed in to change notification settings

DHT-AI-Studio/RAPTOR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

17 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

RAPTOR AI Framework

RAPTOR Logo

Rapid AI-Powered Text and Object Recognition
Open-Source Content Insight Engine for Enterprise AI Applications

License Version Status Python GitHub stars GitHub forks

About β€’ Features β€’ Installation β€’ Quick Start β€’ Documentation β€’ Contributing β€’ Website


RAPTOR is an advanced AI framework developed by the DHT Taiwan Team at DHT Solutions.

πŸš€ Current Release

Aigle 0.1 - First Community Beta Release

This is the initial open-source release of the RAPTOR framework, codenamed "Aigle". We're excited to share this with the community and look forward to your feedback and contributions.

πŸ§ͺ Aigle 0.1 - Evaluation and Testing API

To help developers get started with the RAPTOR framework quickly and easily, we've deployed a test run API on DHT's development infrastructure. This evaluation API allows developers to:

  • Test and evaluate RAPTOR capabilities without setting up infrastructure
  • Develop AI applications using the RAPTOR framework with zero deployment overhead
  • Utilize DHT resources for testing and development purposes
  • Prototype faster by accessing pre-configured AI services

This is an excellent way to explore RAPTOR's features, build proof-of-concepts, and validate your use cases before deploying your own infrastructure.

πŸ”— Access the Evaluation API:
http://raptor_open_0_1_api.dhtsolution.com:8012/

For detailed API documentation, usage examples, and access instructions, please visit the link above.

Note: This is a development environment intended for evaluation and testing purposes. For production deployments, please refer to the Installation and Development sections below.

πŸ“‹ Table of Contents

🎯 About RAPTOR

RAPTOR (Rapid AI-Powered Text and Object Recognition) is a Content Insight Engine that represents a paradigm shift in digital asset management, transforming passive media storage into an intelligent knowledge platform. By leveraging cutting-edge AI technologies including large language models, vector search, and semantic understanding, RAPTOR enables organizations to unlock the full value of their media assets through automated analysis, intelligent search, and actionable insights.

Business Value Proposition

  • 85% reduction in manual content tagging and metadata generation
  • 10x faster content discovery through semantic search
  • 60% improvement in content reuse and operational efficiency
  • Real-time insights from video, audio, and document content
  • Enterprise-grade security, scalability, and integration capabilities

Strategic Differentiators

  1. AI-Native Architecture: Built from the ground up around LLM orchestration and vector search
  2. Multi-Modal Understanding: Unified analysis across video, audio, image, and text
  3. Semantic Intelligence: Context-aware search that understands intent, not just keywords
  4. Open + Enterprise Model: Open-source core with premium enterprise features
  5. Production-Ready: Kubernetes-native with auto-scaling, fault tolerance, and 99.9% uptime

✨ Features

Version Aigle 0.1

This first community release includes:

Core Capabilities

  • Multi-Modal Content Analysis: Process and understand video, audio, images, and text
  • Semantic Search Engine: Context-aware search using vector embeddings
  • AI-Powered Metadata Generation: Automated tagging and content classification
  • LLM Orchestration: Flexible integration with multiple language models
  • Vector Database Integration: High-performance similarity search and retrieval

Intelligence Features

  • Content Understanding: Extract insights from unstructured media
  • Entity Recognition: Identify people, places, objects, and concepts
  • Sentiment Analysis: Understand emotional context in content
  • Topic Modeling: Automatic categorization and clustering
  • Temporal Analysis: Track content evolution over time

Enterprise Ready

  • Scalable Architecture: Kubernetes-native deployment
  • API-First Design: RESTful APIs for seamless integration
  • Security: Enterprise-grade authentication and authorization
  • Monitoring: Built-in observability and logging
  • Extensible: Plugin architecture for custom processors

For detailed release notes, see CHANGELOG.md.

πŸ“¦ Installation

# Clone the repository
git clone https://github.com/DHT-AI-Studio/RAPTOR.git
cd RAPTOR/Aigle/0.1

# Create virtual environment (recommended)
conda create -n CIE python=3.10
conda activate CIE

# Install required dependencies:
pip install -r requirements.txt

Development

Step 1.

cd raptor
chmod +x check-services.sh deploy.sh logs.sh rollback.sh stop-all.sh tag-backup.sh
./deploy.sh

Step 2.Deploy service

  1. Check container status

    ./check-services.sh
  2. Test API connectivity

    # Modellifecycle ζœε‹™
    curl -s http://192.168.157.165:8086/docs
    
    # Assetmanagement
    curl -s http://192.168.157.165:8010/docs
  3. View service logs

    ./logs.sh <service_name>

Quick Start

  1. Create a new user and assign a new branch to the user

    curl -X 'POST' \
    'http://192.168.157.165:8086/users' \
    -H 'accept: application/json' \
    -H 'Content-Type: application/json' \
    -d '{
    "username": "user1",
    "password": "dht888888",
    "password_hash": "",
    "branch": "",
    "permissions": [
        "upload",
        "download",
        "list"
    ]
    }'
  2. Create a new access token for the user

    curl -X 'POST' \
    'http://192.168.157.165:8086/token' \
    -H 'accept: application/json' \
    -H 'Content-Type: application/x-www-form-urlencoded' \
    -d 'grant_type=password&username=user1&password=dht888888&scope=&client_id=string&client_secret=********'
  3. Access RedisInsight πŸ”— http://192.168.157.165:5540

    Add a new connection:

    • Connection Type: Redis Cluster
    • Host: redis1
    • Port: 7000
    • Name: Redis Cluster
    • Authentication: dht888888
  4. Check if the local Ollama model includes qwen2.5:7b

    ollama list

    If not present

    ollama pull qwen2.5:7b
  5. Register MLflow with local Ollama

    curl -X 'POST' \
      'http://192.168.157.165:8010/models/register_ollama' \
      -H 'accept: application/json' \
      -H 'Content-Type: application/json' \
      -d '{
      "local_model_name": "qwen2.5:7b",
      "model_params": 7,
      "registered_name": "qwenforsummary",
      "set_priority_to_one": false,
      "stage": "production",
      "task": "text-generation-ollama",
      "version_description": "Register qwen2.5:7b local model"
    }'
  6. Check if the registration was successful

    curl -X 'GET' \
      'http://192.168.157.165:8010/models/registered_in_mlflow?show_all=false' \
      -H 'accept: application/json'
  7. Start audio/video/image/document service
    9.1 Create Kafka topics:

    cd path/to/kafka
    chmod +x create_topic.sh
    sudo ./create_topic.sh

    9.2 Starting Services

    cd services
    chmod +x start_services.sh
    ./start_services.sh

    9.3 Check if all services are still running

    ./check_services.sh
  8. Produce a test requests topic

    cd /path/to/kafka/test_service
    python test.py
  9. View Service Logs

    cd path/to/kafka
    tail -f service_name.log

    Available service names include: document_orchestrator_service, document_analysis_service, document_summary_service, document_save2qdrant_service, etc. You can replace document with audio, video, or image based on the required service, for example: audio_orchestrator_service, video_analysis_service, image_summary_service, etc

  10. Check Redis Data

    sudo docker exec -it redis-kafka_dev redis-cli --raw
    GET "document_orchestrator:correlation_id"
    GET "video_orchestrator:correlation_id"
    GET "audio_orchestrator:correlation_id"
    GET "image_orchestrator:correlation_id"

    Note: Replace service_name and correlation_id with actual values.

  11. Use the Qdrant Search API to query data
    (8821/video_search, 8822/audio_search, 8823/document_search, 8824/image_search)

    curl -X POST "http://192.168.157.165:8822/audio_search" \
      -H "Content-Type: application/json" \
      -d '{
        "query_text": "OpenAI",
        "embedding_type": "text",
        "limit": 5
      }'

πŸ“š Documentation

Available Documentation

Additional Resources

  • GitHub Wiki (Coming soon) - Tutorials, guides, and best practices
  • API Reference (Coming soon) - Complete API documentation
  • Video Tutorials (Coming soon) - Step-by-step video guides
  • Examples Repository (Coming soon) - Sample projects and use cases

πŸ› οΈ Built With

RAPTOR leverages cutting-edge technologies:

AI & Machine Learning:

  • πŸ€– Large Language Models (LLM) - Multi-provider support
  • 🧠 LangChain - LLM orchestration framework
  • πŸ” Qdrant - High-performance vector database
  • πŸ“Š MLflow - ML lifecycle management
  • 🎯 Sentence Transformers - Text embeddings

Backend & Infrastructure:

  • ⚑ FastAPI - Modern Python web framework
  • 🐍 Python 3.8+ - Core programming language
  • 🐳 Docker & Docker Compose - Containerization
  • ☸️ Kubernetes - Container orchestration (v1.0+)
  • πŸ“¨ Apache Kafka - Event streaming platform
  • πŸ’Ύ Redis Cluster - High-performance caching

Processing & Analysis:

  • πŸŽ₯ FFmpeg - Video/audio processing
  • πŸ”Š Whisper - Speech recognition
  • πŸ–ΌοΈ OpenCV - Computer vision
  • πŸ“„ PyPDF2, python-docx - Document processing
  • 🎡 Librosa - Audio analysis

Observability:

  • πŸ“Š Prometheus - Metrics collection
  • πŸ“ˆ Grafana - Metrics visualization
  • πŸ” ELK Stack - Logging (roadmap)

🌐 Community & Support

GitHub Issues GitHub Pull Requests Contributors Last Commit

We value your feedback and encourage community participation!

πŸ› Reporting Issues & Feature Requests

Please use GitHub Issues to:

  • πŸ› Report bugs
  • ✨ Request new features
  • ❓ Ask questions
  • πŸ’‘ Share suggestions

Before opening an issue:

  • Check existing issues to avoid duplicates
  • Use issue templates when available
  • Provide detailed information and steps to reproduce

πŸ“± Stay Connected

Join our community on multiple platforms:

Telegram Instagram X Website

Follow us for updates, announcements, and community discussions!

Coming soon: Discord server, LinkedIn group, and monthly community calls!

πŸ’¬ Get Help

We'll post updates, respond to questions, and collaborate with users across these platforms!

🀝 Contributing

We welcome contributions from the community! Please read our CONTRIBUTING.md guide to get started.

How to Contribute

  1. Fork the repository
  2. Create a feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

Please read our Code of Conduct before contributing.

πŸ“„ License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Copyright 2025 DHT Taiwan Team

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

πŸ™ Acknowledgments

RAPTOR is developed and maintained by the DHT Taiwan Team.

About DHT Solutions

DHT Solutions is a technology company specializing in AI and software development solutions. Learn more at https://dhtsolution.com/.

πŸ‘₯ Development Team

Meet the talented developers behind RAPTOR:

titanh
titanh
Cing-dht
Cing-dht
fungdht
fungdht
GeorgeDHT
GeorgeDHT
NelsonYou1026
NelsonYou1026
tianyu0223
tianyu0223
Robertdht
Robertdht
QuinnChueh
QuinnChueh
Matthew20040407
Matthew20040407
minnie-dhtsolution
minnie-dhtsolution
lunar8386
lunar8386
Joe-DHT
Joe-DHT
benjamin-dhtsolution
benjamin-dhtsolution

πŸ—ΊοΈ Future Development Roadmap

The following features are planned for upcoming releases to transform RAPTOR into a production-ready, enterprise-grade platform:

1. API Gateway & Integration (v0.2 - Q4 2025)

  • API Gateway: Production-ready gateway with rate limiting and API documentation portal
  • Webhook System: Event-driven integration capability with webhook event system

2. Advanced Video Understanding (v0.3 - Q4 2025)

  • Implement temporal reasoning models for event sequences
  • Add action recognition and activity detection
  • Build scene relationship graphs
  • Create timeline-based navigation interface
  • Note: Docker Compose deployment won't scale to production needs

3. AI LLM Interface - MCP Integration (v0.3 - Q4 2025)

Implement Model Context Protocol (MCP) interfaces for core services:

  • Document Processing
  • Video Analysis
  • Audio Processing
  • Image Analysis
  • Semantic Search
  • Vector Database queries
  • Model Management
  • MCP Prompts

4. Custom Model Training Pipeline (v0.4 - Q1 2026)

  • Build fine-tuning workflows for domain-specific models
  • Optimized Training: Automated pipelines with DeepSpeed optimization, model evaluation framework, customer data isolation
  • Federated Learning: Privacy-preserving distributed training where data stays local, only model updates transferred

5. Admin Dashboard (v0.4 - Q1 2026)

  • Integrate Prometheus/Grafana metrics for real-time monitoring console
  • Create user management UI with role-based access control
  • Permission management and audit logging

6. OAuth2/OpenID Connect Integration (v0.4 - Q1 2026)

  • Migrate from basic JWT to OAuth2/OIDC standards
  • Enterprise SSO support (Azure AD, Google Workspace, Okta)
  • Multi-factor authentication

7. Content Moderation & Compliance (v0.5 - Q1 2026)

  • Train content moderation models (NSFW, violence, hate speech)
  • Implement automated flagging system
  • Build GDPR/CCPA compliance workflows
  • Create comprehensive audit reporting

8. Kubernetes Production Deployment (v1.0 - March 2026)

  • Production-ready Kubernetes deployment with Helm charts
  • Automated horizontal and vertical scaling
  • Service mesh integration for resilience
  • Multi-environment support (dev/staging/prod)
  • Critical: Docker Compose is not suitable for production scale

9. Centralized Logging & Observability (v1.0 - March 2026)

  • Deploy ELK Stack (Elasticsearch, Logstash, Kibana)
  • Configure log shipping from all 25+ services
  • Centralized log aggregation and retention
  • Advanced log search and analytics
  • Distributed tracing integration

Release Timeline

Version Target Focus Key Features
v0.2 Q4 2025 API & Integration API Gateway, Webhooks
v0.3 Q4 2025 AI Enhancement Video Understanding, MCP Interface
v0.4 Q1 2026 Enterprise Features Training Pipeline, Admin Dashboard, OAuth2
v0.5 Q1 2026 Compliance Content Moderation, GDPR/CCPA
v1.0 March 2026 Production Ready Kubernetes, ELK Stack, 99.9% SLA

Current Status: Aigle 0.1 (Community Beta) - October 2025 βœ…
Next Milestone: v0.2 with API Gateway (Q4 2025)
Production Target: v1.0 by March 2026


Made with ❀️ by the DHT Taiwan Team

For business inquiries: https://dhtsolution.com/

About

RAPTOR (Rapid AI-Powered Text and Object Recognition) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis, semantic search, and actionable insights. RAPTOR reducing manual tagging by 85% and making content discovery 10x faster.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published