Thanks to visit codestin.com
Credit goes to Github.com

Skip to content

Conversation

@seandavi
Copy link
Owner

Summary

This PR adds a comprehensive new vignette that demonstrates how to explore and query the OmicIDX DuckDB database using the Omicidx R6 class.

Changes

  • New Vignette: exploring-omicidx-database.qmd - A detailed guide with extensive query examples
  • Documentation Update: Modified _pkgdown.yml to organize vignettes into logical sections

Vignette Contents

The new vignette provides a structured exploration of the database with sections covering:

Basic Operations

  • Initialize the Omicidx R6 client
  • Discover available tables
  • Get database schema overview

Entity-Specific Exploration (H3 sections)

  • SRA Studies: Study types, top centers, publication status
  • SRA Experiments: Platform distribution, instrument models, library strategies
  • SRA Samples: Organism diversity, taxonomic distribution
  • SRA Runs: Data volume statistics, size distribution, read length patterns

Advanced Analysis

  • Cross-Table Queries: Linking studies to data volume, platform usage patterns
  • Complex Queries: RNA-Seq studies, large-scale projects, multi-omics studies
  • Custom SQL: Direct SQL query examples

Benefits

  • Provides users with practical examples of database queries
  • Demonstrates both dplyr and SQL interfaces
  • Shows realistic use cases for genomics research
  • Helps users understand database structure and relationships
  • Organized with clear H3 headers for easy navigation

Testing

The vignette uses standard dplyr and DBI operations that work with the existing Omicidx R6 class. All queries follow best practices for lazy evaluation and efficient data retrieval.

Documentation

The vignette is properly integrated into pkgdown documentation with a dedicated "Data Exploration" section.

- Create new vignette 'exploring-omicidx-database.qmd' with detailed examples
- Demonstrate R6 Omicidx class usage across multiple query types
- Include exploration of studies, experiments, samples, and runs tables
- Add cross-table queries and advanced analysis examples
- Update _pkgdown.yml to include new vignette in documentation
- Organize vignettes into 'Getting Started' and 'Data Exploration' sections
- Correct column names (instrument_model, pubmed_ids)
- Remove queries for non-existent columns (total_bases, total_spots)
- Fix dbplyr compatibility issues (remove paste() with collapse)
- Simplify cross-table joins to avoid suffix issues
- Focus on metadata exploration rather than data volume statistics
- All queries now successfully execute against the actual database
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants