Overview

Relevant source files

Purpose

This document provides a high-level introduction to the wavespectra library, explaining its architecture, core components, and capabilities. It covers the foundational design patterns, data model, and how the system processes ocean wave spectral data.

For detailed information on specific subsystems:

System architecture and design patterns: see System Architecture and Design Patterns
Installation procedures: see Installation and Setup
Command-line interface: see Command-Line Interface
Core data structures: see Core Data Model
Input/output operations: see Input System and Output System
Analysis capabilities: see Wave Analysis and Processing

What is Wavespectra

Wavespectra is a Python library for processing and analyzing ocean wave spectral data. It is built on top of xarray, extending xarray.DataArray and xarray.Dataset with domain-specific functionality for wave spectra through the .spec accessor namespace. The library provides:

Unified data model with standardized coordinates (freq, dir, time) and variables (efth, wspd, wdir, dpt)
Extensive I/O support for 15+ input formats and 7+ output formats
60+ analysis methods for calculating integrated wave parameters, transformations, and wave physics
Spectral partitioning using 7 algorithms including watershed-based methods
High performance through C extensions and dask integration for large datasets

Sources: README.rst1-321 docs/quickstart.rst1-369

Architectural Foundation

xarray Extension via Accessor Pattern

Wavespectra uses xarray's accessor registration pattern to extend core xarray objects without modifying them. Two primary accessor classes are registered under the .spec namespace:

Accessor Pattern Implementation

The SpecArray class extends xarray.DataArray and provides spectral analysis methods. The SpecDataset class extends xarray.Dataset and wraps SpecArray methods while adding dataset-level operations like I/O and spatial selection. Once wavespectra is imported, any xarray object following the required conventions automatically gains the .spec accessor.

Sources: docs/quickstart.rst47-76 docs/library.rst1-134 docs/conventions.rst1-72

Data Conventions

Wavespectra requires specific coordinate and variable naming conventions:

Type	Name	Units	Description	Required
Coordinate	`freq`	Hz	Wave frequency	Yes
Coordinate	`dir`	degrees	Wave direction (coming-from)	Yes (2D spectra)
Coordinate	`time`	datetime	Time coordinate	Optional
Variable	`efth`	m²/Hz/degree or m²/Hz	Wave energy density	Yes
Variable	`wspd`	m/s	Wind speed	Optional (partitioning)
Variable	`wdir`	degrees	Wind direction	Optional (partitioning)
Variable	`dpt`	m	Water depth	Optional (partitioning, physics)

These conventions are defined in wavespectra/core/attributes.py and enforced through the attrs module, which provides CF-compliant metadata from wavespectra/core/attributes.yml

Sources: docs/conventions.rst1-72 README.rst210-228

System Architecture Diagram

Architecture Overview

The system follows a layered architecture:

Input Layer: Dynamically loaded format-specific readers normalize diverse data sources into the unified data model. Backend entrypoints enable xr.open_dataset(file, engine='format') integration.
Core Data Model: xarray objects extended via SpecArray and SpecDataset accessors, providing the .spec namespace for all operations.
Analysis Engines: Modular computation engines for statistics, transformations, partitioning, construction, and physics calculations.
Output Layer: Format-specific writers that transform the unified data model back to external formats.
Utilities: Cross-cutting functionality including C-compiled watershed algorithm for performance-critical operations.

Sources: README.rst1-321 docs/api.rst1-309

Data Flow Through the System

Data Flow Steps

Read: Format-specific reader (e.g., read_swan(), read_ww3()) parses external format
Normalize: Convert to standard coordinates (freq, dir, time) and variables (efth)
Enhance: Apply CF-compliant metadata from wavespectra/core/attributes.yml
Access: User accesses .spec namespace for analysis methods
Analyze: Methods operate on normalized data structure
Write: Format-specific writer transforms back to external format (optional)

Sources: README.rst84-130 docs/quickstart.rst77-102

Core Capabilities

Input/Output System

The I/O system supports 15+ input formats and 7+ output formats through a plugin architecture:

Input Formats:

Wave models: SWAN, WAVEWATCH III, WWM, Funwave
Reanalysis: ERA5, ECMWF
Observations: NDBC (ASCII/NetCDF), AWAC (NMEA), Triaxys, Spotter, Octopus
Generic: NetCDF, JSON, XWaves

Output Formats:

Wave models: SWAN, WAVEWATCH III, Funwave, OrcaFlex
Generic: NetCDF, JSON, Octopus

Readers are dynamically loaded by wavespectra/input/__init__.py and exposed at module level. xarray backend entrypoints are defined in pyproject.toml for seamless xr.open_dataset() integration.

Sources: README.rst229-238 docs/api.rst113-178

Analysis Methods (60+)

The SpecArray accessor provides 60+ methods organized into categories:

Category	Methods	Examples
Wave Parameters	Integrated statistics	`hs()`, `tp()`, `fp()`, `tm01()`, `tm02()`, `dp()`, `dpm()`, `dm()`, `dspr()`
Transformations	Spectral manipulation	`oned()`, `split()`, `rotate()`, `interp()`, `smooth()`, `scale_by_hs()`
Moments	Spectral moments	`momf()`, `momd()`
Wave Physics	Physical properties	`celerity()`, `wavelen()`, `uss()` (Stokes drift)
Fitting	Parametric fitting	`fit_jonswap()`, `fit_gaussian()`

All methods leverage xarray's lazy evaluation and dask integration for efficient processing of large datasets.

Sources: docs/api.rst23-92 README.rst94-102

Spectral Partitioning System

The partitioning subsystem separates complex wave spectra into physically meaningful components (wind sea + swells). Accessed via .spec.partition, it provides 7 algorithms:

The watershed algorithm in wavespectra/partition/specpart.c provides 10-100x speedup for critical partitioning operations. Algorithms requiring meteorological data use the wave age criterion from wavespectra/core/utils.py to classify partitions as wind sea vs swell.

Sources: README.rst134-158 docs/api.rst56-72 docs/quickstart.rst258-297

Spectral Construction

The construction subsystem creates synthetic spectra from parametric forms:

Frequency Shapes (wavespectra/construct/frequency.py):

jonswap(): JONSWAP spectrum for developing seas
tma(): TMA spectrum for finite depth
gaussian(): Gaussian spectrum
pierson_moskowitz(): Pierson-Moskowitz spectrum

Directional Shapes (wavespectra/construct/direction.py):

cartwright(): Cartwright directional distribution
asymmetric(): Asymmetric directional distribution

Combined Construction:

construct_partition(): Combines frequency and directional shapes into 2D spectra
partition_and_reconstruct(): Partition existing spectra, fit parametric forms, and reconstruct

Sources: README.rst160-189 docs/api.rst180-194 docs/quickstart.rst299-325

Development Infrastructure

Build System

The package uses setuptools with a C extension for the watershed algorithm. Key build components:

setup.py: Defines C extension with NumPy include dependencies
pyproject.toml: Project metadata, dependencies, xarray backend entrypoints, CLI entrypoints
wavespectra/partition/specpart.c: C source for watershed algorithm
wavespectra/partition/specpart.h: C headers

The build system compiles the C extension during installation, requiring a C compiler and NumPy headers.

Sources: docs/install.rst1-139

Testing and CI/CD

Testing infrastructure includes:

Test Framework: pytest with tox for multi-environment testing
CI/CD: GitHub Actions testing matrix across Python 3.9-3.13 on Ubuntu and macOS
Sample Data: tests/sample_files/ directory with test fixtures
Linting: ruff for code quality

Sources: README.rst277-283

Plugin Architecture

The system uses three plugin mechanisms:

Dynamic Reader Loading: wavespectra/input/__init__.py scans input/*.py files and exposes read_* functions at module level
xarray Backend Entrypoints: Defined in pyproject.toml for xr.open_dataset(engine='format') integration
Output Plugin Metaclass: Plugin pattern for output methods registered to SpecDataset

This architecture allows users to extend the system with custom readers and writers without modifying core code.

Sources: docs/api.rst1-309

Usage Patterns

Basic Analysis Workflow

Advanced Partitioning Workflow

Output Workflow

Sources: README.rst84-130 docs/quickstart.rst1-369

Overview

Relevant source files

Purpose

For detailed information on specific subsystems:

System architecture and design patterns: see System Architecture and Design Patterns
Installation procedures: see Installation and Setup
Command-line interface: see Command-Line Interface
Core data structures: see Core Data Model
Input/output operations: see Input System and Output System
Analysis capabilities: see Wave Analysis and Processing

What is Wavespectra

Unified data model with standardized coordinates (freq, dir, time) and variables (efth, wspd, wdir, dpt)
Extensive I/O support for 15+ input formats and 7+ output formats
60+ analysis methods for calculating integrated wave parameters, transformations, and wave physics
Spectral partitioning using 7 algorithms including watershed-based methods
High performance through C extensions and dask integration for large datasets

Sources: README.rst1-321 docs/quickstart.rst1-369

Architectural Foundation

xarray Extension via Accessor Pattern

Wavespectra uses xarray's accessor registration pattern to extend core xarray objects without modifying them. Two primary accessor classes are registered under the .spec namespace:

Accessor Pattern Implementation

Sources: docs/quickstart.rst47-76 docs/library.rst1-134 docs/conventions.rst1-72

Data Conventions

Wavespectra requires specific coordinate and variable naming conventions:

Type	Name	Units	Description	Required
Coordinate	`freq`	Hz	Wave frequency	Yes
Coordinate	`dir`	degrees	Wave direction (coming-from)	Yes (2D spectra)
Coordinate	`time`	datetime	Time coordinate	Optional
Variable	`efth`	m²/Hz/degree or m²/Hz	Wave energy density	Yes
Variable	`wspd`	m/s	Wind speed	Optional (partitioning)
Variable	`wdir`	degrees	Wind direction	Optional (partitioning)
Variable	`dpt`	m	Water depth	Optional (partitioning, physics)

These conventions are defined in wavespectra/core/attributes.py and enforced through the attrs module, which provides CF-compliant metadata from wavespectra/core/attributes.yml

Sources: docs/conventions.rst1-72 README.rst210-228

System Architecture Diagram

Architecture Overview

The system follows a layered architecture:

Input Layer: Dynamically loaded format-specific readers normalize diverse data sources into the unified data model. Backend entrypoints enable xr.open_dataset(file, engine='format') integration.
Core Data Model: xarray objects extended via SpecArray and SpecDataset accessors, providing the .spec namespace for all operations.
Analysis Engines: Modular computation engines for statistics, transformations, partitioning, construction, and physics calculations.
Output Layer: Format-specific writers that transform the unified data model back to external formats.
Utilities: Cross-cutting functionality including C-compiled watershed algorithm for performance-critical operations.

Sources: README.rst1-321 docs/api.rst1-309

Data Flow Through the System

Data Flow Steps

Read: Format-specific reader (e.g., read_swan(), read_ww3()) parses external format
Normalize: Convert to standard coordinates (freq, dir, time) and variables (efth)
Enhance: Apply CF-compliant metadata from wavespectra/core/attributes.yml
Access: User accesses .spec namespace for analysis methods
Analyze: Methods operate on normalized data structure
Write: Format-specific writer transforms back to external format (optional)

Sources: README.rst84-130 docs/quickstart.rst77-102

Core Capabilities

Input/Output System

The I/O system supports 15+ input formats and 7+ output formats through a plugin architecture:

Input Formats:

Wave models: SWAN, WAVEWATCH III, WWM, Funwave
Reanalysis: ERA5, ECMWF
Observations: NDBC (ASCII/NetCDF), AWAC (NMEA), Triaxys, Spotter, Octopus
Generic: NetCDF, JSON, XWaves

Output Formats:

Wave models: SWAN, WAVEWATCH III, Funwave, OrcaFlex
Generic: NetCDF, JSON, Octopus

Readers are dynamically loaded by wavespectra/input/__init__.py and exposed at module level. xarray backend entrypoints are defined in pyproject.toml for seamless xr.open_dataset() integration.

Sources: README.rst229-238 docs/api.rst113-178

Analysis Methods (60+)

The SpecArray accessor provides 60+ methods organized into categories:

Category	Methods	Examples
Wave Parameters	Integrated statistics	`hs()`, `tp()`, `fp()`, `tm01()`, `tm02()`, `dp()`, `dpm()`, `dm()`, `dspr()`
Transformations	Spectral manipulation	`oned()`, `split()`, `rotate()`, `interp()`, `smooth()`, `scale_by_hs()`
Moments	Spectral moments	`momf()`, `momd()`
Wave Physics	Physical properties	`celerity()`, `wavelen()`, `uss()` (Stokes drift)
Fitting	Parametric fitting	`fit_jonswap()`, `fit_gaussian()`

All methods leverage xarray's lazy evaluation and dask integration for efficient processing of large datasets.

Sources: docs/api.rst23-92 README.rst94-102

Spectral Partitioning System

The partitioning subsystem separates complex wave spectra into physically meaningful components (wind sea + swells). Accessed via .spec.partition, it provides 7 algorithms:

Sources: README.rst134-158 docs/api.rst56-72 docs/quickstart.rst258-297

Spectral Construction

The construction subsystem creates synthetic spectra from parametric forms:

Frequency Shapes (wavespectra/construct/frequency.py):

jonswap(): JONSWAP spectrum for developing seas
tma(): TMA spectrum for finite depth
gaussian(): Gaussian spectrum
pierson_moskowitz(): Pierson-Moskowitz spectrum

Directional Shapes (wavespectra/construct/direction.py):

cartwright(): Cartwright directional distribution
asymmetric(): Asymmetric directional distribution

Combined Construction:

construct_partition(): Combines frequency and directional shapes into 2D spectra
partition_and_reconstruct(): Partition existing spectra, fit parametric forms, and reconstruct

Sources: README.rst160-189 docs/api.rst180-194 docs/quickstart.rst299-325

Development Infrastructure

Build System

The package uses setuptools with a C extension for the watershed algorithm. Key build components:

setup.py: Defines C extension with NumPy include dependencies
pyproject.toml: Project metadata, dependencies, xarray backend entrypoints, CLI entrypoints
wavespectra/partition/specpart.c: C source for watershed algorithm
wavespectra/partition/specpart.h: C headers

The build system compiles the C extension during installation, requiring a C compiler and NumPy headers.

Sources: docs/install.rst1-139

Testing and CI/CD

Testing infrastructure includes:

Test Framework: pytest with tox for multi-environment testing
CI/CD: GitHub Actions testing matrix across Python 3.9-3.13 on Ubuntu and macOS
Sample Data: tests/sample_files/ directory with test fixtures
Linting: ruff for code quality

Sources: README.rst277-283

Plugin Architecture

The system uses three plugin mechanisms:

Dynamic Reader Loading: wavespectra/input/__init__.py scans input/*.py files and exposes read_* functions at module level
xarray Backend Entrypoints: Defined in pyproject.toml for xr.open_dataset(engine='format') integration
Output Plugin Metaclass: Plugin pattern for output methods registered to SpecDataset

This architecture allows users to extend the system with custom readers and writers without modifying core code.

Overview

Purpose

What is Wavespectra

Architectural Foundation

xarray Extension via Accessor Pattern

Data Conventions

System Architecture Diagram

Data Flow Through the System

Core Capabilities

Input/Output System

Analysis Methods (60+)

Spectral Partitioning System

Spectral Construction

Development Infrastructure

Build System

Testing and CI/CD

Plugin Architecture

Usage Patterns

Basic Analysis Workflow

Advanced Partitioning Workflow

Output Workflow

On this page

Overview

Purpose

What is Wavespectra

Architectural Foundation

xarray Extension via Accessor Pattern

Data Conventions

System Architecture Diagram

Data Flow Through the System

Core Capabilities

Input/Output System

Analysis Methods (60+)

Spectral Partitioning System

Spectral Construction

Development Infrastructure

Build System

Testing and CI/CD

Plugin Architecture

Usage Patterns

Basic Analysis Workflow

Advanced Partitioning Workflow

Output Workflow

On this page