Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Neural-Nirvana/canana

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

12 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎨 Canana - Visual Prompting Canvas for Nano Banana AI

The Revolutionary Digital Canvas for Advanced Visual Prompting with Google's Nano Banana

Canana transforms how you interact with Google's state-of-the-art Nano Banana (Gemini 2.5 Flash Image Preview) through intuitive visual prompting. Draw, annotate, and guide AI with arrows, text, and canvas-based instructions for unprecedented precision and control.

Canana Interface AI Models License


✨ What is Visual Prompting with Nano Banana?

Visual prompting represents the cutting-edge evolution of AI interaction, moving beyond simple text descriptions to sophisticated canvas-based instruction systems. Canana enables you to:

  • Draw directly on canvas with arrows, circles, and annotations that Nano Banana understands
  • Combine visual cues with natural language for precise editing control
  • Use multi-turn conversational editing to refine images iteratively
  • Maintain character consistency across multiple transformations
  • Blend multiple images with contextual understanding

This approach leverages Google's Nano Banana (officially Gemini 2.5 Flash Image Preview), the world's top-rated image editing model that excels at understanding visual instructions, maintaining lighting consistency, and performing targeted transformations while preserving everything else intact.


🧠 Powered by Nano Banana: Google's SOTA Image AI

What Makes Nano Banana Revolutionary

Nano Banana isn't just another text-to-image toolβ€”it's a prompt-driven photo editor that provides Photoshop-like control through simple visual instructions:

  • Targeted Editing: Edit only what you specify while keeping everything else intactβ€”lighting, perspective, textures, even facial features
  • Character Consistency: Maintains subject appearance across multiple images for rich storytelling
  • Multi-Image Fusion: Understands and merges multiple input images with contextual awareness
  • Real-Time Performance: Often responds in 1-2 seconds, feeling like real-time editing
  • SynthID Watermarking: Built-in digital watermarking for AI-generated content

Technical Specifications

  • Model: gemini-2.5-flash-image-preview
  • Performance: Top-rated on LMArena benchmarks
  • Pricing: $0.039 per image (1290 output tokens)
  • Speed: 1-2 second response times
  • Availability: Gemini API, Google AI Studio, Vertex AI

πŸš€ Advanced Visual Prompting Features

🎯 Canvas-Based Instruction System

  • Arrow Annotations: Direct Nano Banana's attention to specific areas
  • Bounding Box Markup: Mark regions with rectangles, ellipses, and shapes
  • Textual Overlays: Add natural language instructions directly on canvas
  • Multi-Layer Annotations: Combine visual cues for complex edits
  • Instruction Exclusion: AI automatically removes instruction markup from final output

🎨 Professional Canvas Tools

  • Precision Drawing: Brushes, shapes, and freehand tools optimized for AI instruction
  • Smart Import System: Drag & drop images with automatic canvas integration
  • Multi-Workspace: Organize complex projects across multiple canvases
  • Zoom-Aware Export: Export only the current viewport area at any zoom level

βš™οΈ Nano Banana Optimization Engine

  • Prompt Templates: Pre-optimized prompts for different editing scenarios:
    • πŸ“Έ Photorealistic Enhancement: "A photorealistic [shot type] of [subject], [action], set in [environment]. The scene is illuminated by [lighting], creating a [mood] atmosphere"
    • 🎨 Style Transfer: Blend multiple images while maintaining composition
    • ✏️ Targeted Editing: Remove, add, or modify specific elements
    • 🌈 Creative Transformation: Artistic interpretation with consistent lighting
  • Multi-Turn Editing: Iterative refinement through conversational prompting
  • Context Preservation: Maintains scene coherence across multiple edits

πŸ”’ Privacy & Performance

  • Local Processing: All canvas data processed locally
  • Encrypted API Storage: Secure API key management
  • Optimized Requests: Smart batching and compression for Nano Banana API
  • Real-Time Feedback: Live processing status with stage-by-stage updates

🎭 Visual Prompting Workflows

1. Targeted Object Editing

Your Workflow:
1. Import or create base image on canvas
2. Draw arrow pointing to object to modify
3. Add text annotation: "Change this car to red sports car"
4. Nano Banana edits only the specified object
5. Download result with perfect lighting consistency

Perfect for: Product mockups, object replacement, color changes

2. Compositional Transformation

Your Workflow:
1. Upload reference image to canvas
2. Use bounding boxes to mark areas for change
3. Add directional arrows for lighting/perspective
4. Annotate with scene description changes
5. Generate transformed composition

Perfect for: Background replacement, scene enhancement, artistic interpretation

3. Character Consistency Workflow

Your Workflow:
1. Create character in first canvas
2. Use arrows to highlight key facial features
3. Annotate desired pose/expression changes
4. Generate variations maintaining character likeness
5. Build consistent character library

Perfect for: Storytelling, comic creation, brand mascots

4. Multi-Image Fusion

Your Workflow:
1. Import multiple source images
2. Use annotations to specify which elements to combine
3. Add arrows showing desired composition layout
4. Nano Banana intelligently blends with proper lighting
5. Refine through multi-turn editing

Perfect for: Collages, concept visualization, creative compositing

πŸ›  Quick Start with Nano Banana

Prerequisites

  • Node.js 16+ and npm
  • Google Gemini API key (Get one here)
  • Modern browser with canvas support

Installation & Setup

# Clone the repository
git clone https://github.com/Neural-Nirvana/canana.git
cd canana

# Install dependencies
npm install

# Start development server
npm start

# Open http://localhost:3000 in your browser

First Visual Prompting Session

  1. Click the ✨ AI Magic button (rainbow gradient in dock)
  2. Enter your Google Gemini API key in secure panel
  3. Draw or import an image on canvas
  4. Add arrows and text annotations for editing instructions
  5. Click generate and watch Nano Banana work its magic!

πŸ’‘ Pro Tips for Nano Banana Visual Prompting

Arrow Annotation Best Practices

βœ… DO: Point arrows at specific objects or areas to edit
βœ… DO: Use arrows to indicate lighting direction changes
βœ… DO: Combine arrows with descriptive text for clarity
❌ DON'T: Overuse arrows - Nano Banana understands context

Prompt Engineering for Visual Instructions

βœ… DO: Write descriptive paragraphs, not keyword lists
βœ… DO: Be specific about edits: "make background a neon diner at night"
βœ… DO: Use multi-turn editing - change one thing at a time
❌ DON'T: Try to change everything in a single prompt

Canvas Composition Guidelines

βœ… DO: Use contrasting colors for instruction markup
βœ… DO: Leave space around objects for context understanding
βœ… DO: Group related annotations logically
❌ DON'T: Overcrowd the canvas with competing instructions

Lighting and Consistency Tips

βœ… DO: Trust Nano Banana's lighting consistency algorithms
βœ… DO: Use arrows to indicate desired light source direction
βœ… DO: Annotate material properties for realistic rendering
❌ DON'T: Over-specify technical details - describe naturally

🌟 Advanced Nano Banana Capabilities

Real-Time Processing Feedback

Canana provides live visual feedback during Nano Banana processing:

  • 🎨 Preparing: Canvas analysis and instruction parsing
  • πŸ“€ Uploading: Secure API communication with compression
  • 🧠 Processing: Nano Banana model inference
  • ✨ Generating: Image synthesis and editing
  • πŸŽ‰ Complete: Final result with instruction markup removed

Smart Output Processing

Nano Banana + Canana automatically optimizes results:

  • Instruction Removal: Automatically excludes arrows, text, and UI elements
  • Intelligent Cropping: Crops to natural content boundaries
  • Lighting Harmonization: Ensures consistent lighting across all elements
  • Perspective Correction: Maintains proper spatial relationships

Multi-Workspace Project Management

  • Canvas Collections: Organize related visual prompting sessions
  • Auto-Save: All work automatically preserved with thumbnails
  • Export Options: Download individual results or project collections
  • Version History: Track iterative improvements through visual prompting

🎯 Professional Use Cases

Creative Professionals

  • Concept Artists: Rapid ideation with consistent character development
  • UI/UX Designers: Interface mockup enhancement and A/B testing
  • Product Designers: Photorealistic product visualization with lighting control
  • Photographers: Advanced photo editing with natural language instructions

Content Creators & Marketers

  • Social Media: Eye-catching visuals with brand consistency
  • Advertising: Campaign assets with targeted messaging
  • E-commerce: Product photography enhancement and variant generation
  • Brand Development: Logo iterations and visual identity exploration

Developers & Teams

  • Prototyping: Visual mockups with annotation-driven refinement
  • Documentation: Technical diagrams with professional enhancement
  • Presentations: Slide graphics with consistent visual themes
  • User Research: Visual concepts for user testing and feedback

πŸ”§ Technical Architecture

Frontend Stack

  • React 18: Modern UI with hooks and concurrent rendering
  • Fabric.js: Professional canvas manipulation and event handling
  • CSS3: Glass morphism effects and smooth animations

Nano Banana Integration

  • Google GenAI SDK: Official Gemini 2.5 Flash Image Preview API
  • Streaming Processing: Real-time generation status updates
  • Intelligent Compression: Optimized base64 encoding for API efficiency
  • Error Handling: Robust retry logic and graceful degradation

Performance Optimization

  • Local Canvas Processing: Client-side image manipulation
  • Smart Caching: Efficient workspace and result storage
  • Lazy Loading: Performance-optimized rendering for large projects
  • Memory Management: Automatic cleanup for long editing sessions

🀝 Contributing to Canana

Help us advance visual prompting technology for Nano Banana and future AI models!

Development Priorities

  • Enhanced Annotation Tools: More sophisticated visual instruction methods
  • Batch Processing: Multiple canvas processing with Nano Banana
  • Export Formats: Additional output options for professional workflows
  • Accessibility: Improved tools for users with disabilities

Setup for Contributors

# Fork and clone the repository
git clone https://github.com/Neural-Nirvana/canana.git

# Create feature branch
git checkout -b feature/visual-prompting-enhancement

# Install development dependencies
npm install

# Run development server with hot reload
npm start

# Run tests and linting
npm test && npm run lint

πŸ“Š Nano Banana Model Information

Model Capabilities

  • Visual Understanding: Interprets complex visual instructions and annotations
  • Instruction Following: Processes arrows, shapes, and textual overlays
  • Context Preservation: Maintains scene consistency across edits
  • Multi-Modal Input: Combines text, images, and visual annotations
  • Real-Time Performance: Industry-leading response times for interactive editing

Supported Visual Prompt Types

  • Rectangle and ellipse bounding boxes
  • Directional arrows and pointing indicators
  • Freehand scribbles and masks created with BΓ©zier curves
  • Text annotations and natural language overlays
  • Multi-image composition instructions

Quality Assurance

  • Top-Rated Performance: #1 on LMArena benchmarks
  • Consistent Output: Reliable lighting, shadows, and color temperature
  • Professional Quality: Suitable for commercial and creative applications
  • SynthID Integration: Built-in AI content identification

πŸ“„ License & Legal

MIT License - see LICENSE file for full terms.

Note: Usage of Google's Nano Banana model is subject to Google's AI terms of service. Canana is an independent interface tool and is not officially affiliated with Google.


πŸ™ Acknowledgments

  • Google DeepMind: For the revolutionary Nano Banana model
  • Visual Prompting Research Community: For advancing annotation-based AI interaction
  • Fabric.js Team: For the robust canvas manipulation library
  • React Community: For the powerful frontend framework
  • Open Source Contributors: For continuous improvement and feedback

πŸ“ž Support & Resources


Built for the Future of Visual AI Interaction

Canana - Where Visual Prompting Meets Nano Banana's Intelligence

Transform your creative workflow with the world's most advanced visual prompting canvas, optimized for Google's state-of-the-art Nano Banana AI model.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published