Abstract
Understanding marine surface fish diversity is crucial for ecosystem management. However, the traditional sampling methods are often invasive, costly, or unsuitable for certain species or locations. Environmental DNA (eDNA) metabarcoding provides a non-invasive and relatively cheap alternative to explore patterns of diversity. It is important to recognize that, eDNA-based inference can vary across sampling methods, potentially impacting the validity of biodiversity assessments. To evaluate and compare the effectiveness of three eDNA sampling methods—ship-bottom intake (4.5 m), Niskin bottles (5 or 10 m), and bucket (0 m)—for assessing fish diversity and fish community composition in the western North Pacific near Japan, we analyzed fish communities from 83 stations sampled during nine research cruises. Taxonomic analysis revealed that each method detected over 324 taxa, contributing to a total of 465 taxa. Hierarchical clustering generally identified similar species composition across methods at a station. The exception was when intake samples, collected at different times, diverged from bucket and Niskin samples at the same station. Hill’s number rarefaction and extrapolation curves across all clusters showed similar results among methods, with exceptions in a few clusters where bucket samples exhibited higher biodiversity indices than intake and Niskin samples. Non-metric multidimensional scaling indicated significant relationships between cluster composition and environmental factors like temperature, salinity, and chlorophyll-a. Some clusters were also controlled by integrated seasonal factors. Overall, fish community composition was convincingly similar among methods. This finding suggests that any of these eDNA sampling methods can be effective and may be prioritized based on logistical considerations.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Understanding patterns of marine fish biodiversity is crucial as species richness considerably influences ecosystem functions, such as nutrient cycling and productivity (Thomsen et al. 2012; Gamfeldt et al. 2015). Assessing taxonomic diversity is vital for developing effective conservation and fishery management strategies (Różański et al. 2022), especially in areas with complex oceanic fronts (Saito 2019) and eddies (Xing et al. 2023), which act as biogeochemical hotspots by altering nutrient distributions and primary production (Woodson and Litvin 2015). These dynamics promote high productivity and growth for marine fish and their larvae (Bakun 2006).
Pelagic fish communities in ocean front ecosystems can serve as indicators of biodiversity hotspots (Miller and Christodoulou 2014) and play crucial roles in ecosystem functioning (Watari et al. 2019).
However, monitoring of marine surface biodiversity in pelagic environments remains limited (Martin et al. 2020), and a significant portion of pelagic fish species’ diversity and community composition is unknown (Higgs and Attrill 2015). This gap in sampling effort is largely due to the challenges associated with the diversity in pelagic fish size, behavior (Li et al. 2022), and distribution, which are often linked to environmental parameters (Yu et al. 2023). Marine pollution (Montenegro et al. 2020) and global climate change, which shift the distribution of marine life (Poloczanska et al. 2013), further complicate this issue. Additionally, the negative impacts of human activities on marine biodiversity, such as overfishing and high by-catch rates, have been well-documented (Jackson et al. 2001; Sandoval Gallardo et al. 2021), underscoring the need for ongoing biodiversity monitoring and community observations (Miloslavich et al. 2018).
The traditional sampling methods, such as trawling, or netting, face additional challenges when applied to marine surface fish communities. They are often ineffective at capturing rare species (Schmelzle and Kinziger 2016) thus limiting the taxonomic resolution of entire communities (Jac et al. 2021). Moreover, these traditional methods can be labor-intensive, costly, and invasive (Evans et al. 2017; Zhang et al. 2020), contributing to a significant knowledge gap about the biodiversity of many marine ecosystems (Claudet and Fraschetti 2010). Consequently, large-scale, non-invasive monitoring technologies are urgently needed to assess the biodiversity and community composition of poorly understood marine surface communities (Kennedy et al. 2020).
Environmental DNA (eDNA) has emerged as a valuable tool for ecosystem monitoring, providing a rapid, non-invasive, and cost-efficient approach to biodiversity assessment (He et al. 2023; Gold et al. 2021). eDNA enhances species detection by over 30% compared to traditional sampling methods and offers insights into fine-scale spatial and temporal variations within marine surface communities (Stat et al. 2019). As a comprehensive method, eDNA sampling holds promise for revolutionizing how marine biodiversity is monitored and understood (Afzali et al. 2021; Miya 2022). However, eDNA metabarcoding success depends on factors such as primer selection (Schenekar et al. 2020), methodology (Sakata et al. 2021), choice of bioinformatic pipeline (Brandt et al 2021), and contamination control (Andruszkiewicz et al. 2017; Fujii et al. 2019). The MiFish primer is widely used for fish species and has been shown to outperform other primers (Collins et al. 2018; Zhang et al. 2019), amplifying and identifying fish species’ eDNA by targeting a specific mitochondrial DNA region (Miya et al. 2015). However, the open-ocean application of this primer requires careful consideration of factors such as sampling effort (Kawakami et al. 2023) and the prevailing environmental context (Yu et al. 2023). Additionally, due to the relatively low quantity of DNA quantities found in ocean surface waters, sampling methods need to be optimized to increase the likelihood of the MiFish primer being effective (Yu et al. 2022).
Like all biodiversity sampling techniques, eDNA metabarcoding is subject to various biases and limitations (Pompanon et al. 2012; Fonseca 2018; Jo et al. 2019). In marine environments, the standard eDNA metabarcoding approach typically involves filtered water replicates collected from the water column (Stat et al. 2017). It has been argued that employing multiple sampling methods concurrently is needed to generate the most comprehensive assessment of biodiversity (Alexander et al. 2023). Therefore, MiFish metabarcoding studies usually sample the water column with Niskin bottles—typically deployed at depths starting from 10 m (Yu et al. 2022; Alexander et al. 2023); bucket samples collected from the sea surface (Miyata et al. 2022); and ship-bottom intake samples drawn from the vessel’s laboratory faucet, located several meters below the surface (Kawakami et al. 2023). Each method has advantages and disadvantages: the intake method allows for continuous sampling along the ship tracks, while the Niskin bottle and bucket methods require the vessel to stop (Yu et al. 2022). It is relatively easy to keep the Niskin bottles and buckets clean, whereas maintaining a clean environment with the intake method is more challenging. Additionally, the intake method may face risks from metal ion obstructions (Greco et al. 2022; Dumoulin et al. 2024). That said, all methods are vulnerable to the impacts of calcium ions that—being alkaline earth metals—can interfere with eDNA detection (Dumoulin et al. 2024).
The ocean’s surface layers are relatively well mixed by wind and waves and so eDNA is expected to be homogeneously distributed within this layer if there are no biases in the vertical distribution of fish (an example is Littlefair et al. 2020, although they focused on a lake). All bucket, intake, and 10 m Niskin bottle samples are expected to be within the mixed layer, and thus sampling the same eDNA field. In our previous research (Ahmed et al., 2025, submitted), we investigated species-specific fish detection in the western North Pacific—a region marked by dynamic oceanic fronts where the warm Kuroshio and cold Oyashio currents meet, creating a wide temperature range between the Kuroshio Extension and Oyashio Front (Yasuda 2003; Kida et al. 2015). We found that the bucket, intake, and Niskin sampling methods showed comparable detection performance for frequently detected species, as indicated by relative read abundance (RRA). However, some species such as the Japanese jack mackerel (Trachurus japonicus) and Pacific saury (Cololabis saira) showed better RRA in bucket samples. Nonetheless, the overall observed similarity in detection performance across methods suggests that combining data generated from different water collection methods on different occasions may be a viable strategy to generate larger spatio-temporal scale analyses of marine surface biodiversity. Still, it remains to be formally tested whether the bucket, intake, and Niskin methods actually capture similar types of biodiversity and fish community composition.
A meaningful comparison of eDNA methods requires each protocol to be deployed concurrently to sample the same parcel of water. However, even for such a presumably exact comparison, the sparse distribution of eDNA in the ocean may mean that sample-to-sample differences eventuate by chance (Kawakami et al. 2023). Here, we reduce the influence of single sample anomalies by comparing a large dataset of concurrently collected water samples across three methods compiled from multiple cruises under varying conditions. This approach enabled us to investigate whether using different common surface water sampling methods affect the detection of pelagic fish species and community composition within the well-mixed surface layer, where environmental conditions are generally uniform at a given time point.
2 Materials and methods
2.1 Study sites
The samples were collected from diverse habitats across the western North Pacific, including the Kuroshio (subtropical) region, Kuroshio Extension, the mixed water region (transition zone between the Kuroshio Extension and Oyashio Front), the Tsugaru Warm Water Current region, and the Oyashio (subarctic) area (Fig. 1). In addition, a single sample station was located in the Sea of Japan. Sampling was conducted at various stations, yielding a total of 83 sample sets during eight research cruises aboard the R/V Shinsei Maru and one cruise on the R/V Hakuho Maru. Of these, 52 sample sets were obtained from Ahmed et al. (2025, submitted), while 31 additional sets from the KS-22-15 and KH-23-3 cruises were added in this study. Each sample set consisted of samples collected using three different methods: bucket, intake, and Niskin. Table 1 summarizes the cruises, specifying the sampling months and number of stations, and Fig. 1 shows the station locations. The data were also used to analyze species-specific detection performance across sampling methods in the surface layer (Ahmed et al., 2025, submitted), biogeographic boundaries at the Takara Gap (Inoue et al., 2025, submitted), and fish species composition, including subsurface layers (Lin et al., 2025, submitted).
Distribution of sampling sites in the western North Pacific, including a single site in the Sea of Japan. Different colored circles represent sampling sites from various research cruises: KS-22-15 (October 2022), KS-21-11 (June 2021), KS-21-12 (June-July 2021), KS-21-8 (May 2021), KS-21-3 (March 2021), KS-18-5 (May 2018), KS-22-11 (August 2022), KH-23-3 (July 2023), and KS-21-24 (October 2021). To distinguish the overlapping sampling locations of the KS-21-24 and KS-22-11 cruises, KS-21-24 sites are marked with 'X' symbols, while KS-22-11 sites are indicated with circles. Red arrows highlight warm ocean currents, while blue arrows show cold currents. 'TWC' refers to the Tsugaru Warm Current
Each cruise had distinct research objectives, leading to an uneven distribution of sampling points across various seasons. We joined these cruises to collect samples from a broad range of sites in the western North Pacific, allowing for a comprehensive assessment of marine surface biodiversity and community composition using different eDNA sampling methods under varied environmental conditions.
2.2 Sample collection
At each station, surface water samples were collected using three methods. Niskin bottles (hereafter referred to as “Niskin”) were deployed at depths below 5 or 10 m (depth depending on the season), while surface water was collected at 0 m using a clean bucket (hereafter referred to as “bucket”) dropped directly over the side of the ship. Additionally, water was sampled via the ship’s seawater intake system (hereafter referred to as “intake”), which collects water from approximately 4.5 m below the surface through a designated seawater faucet. Niskin bottles were equipped with a conductivity temperature depth (CTD) system to record various environmental parameters.
At each station, approximately 10 L of water was collected per sample simultaneously using all three methods. The exception was during the KS-18-5 cruise, where intake samples were collected at a different time to Niskin and bucket samples due to underway CTD observation. Here, about 7 L was collected per sample in each method. During the KH-23-3 cruise, samples from all three methods were collected from two stations every 4 h to compensate for the limited number of observation sites.
Prior to sample collection, clean plastic sampling containers were washed three times with the sampled seawater. Each sample was then weighed and filtered using a Sterivex-HV pressure filter unit with a 0.45 μm pore size, following the procedure outlined by Yu et al. (2022). During the KS-18-5 cruise, a 0.22 μm pore-size filter of the same type was used. After filtration, the filter units were filled with 2 mL of RNAlater using a disposable syringe to preserve the DNA and then stored at − 25 °C.
To ensure accurate seawater sampling and assess potential contamination during onboard handling, two negative control samples consisting of 1.5 L of Milli-Q water were collected at the start and end of each cruise. The Milli-Q water was generated on board and filtered following the same procedure used for the seawater samples. The KS-18-5 cruise, conducted in 2018, was among the early applications of eDNA metabarcoding in open ocean studies. At that time, the necessity of including negative controls was not fully recognized, as also highlighted by Sepulveda et al. (2020), and therefore, they were not collected. Filtration took place exclusively in a designated area of the research vessel’s laboratory. All equipment were thoroughly cleaned using a 1% bleach solution and rinsed with Milli-Q water to eliminate any bleach residue before each use, minimizing cross-contamination risk. Fresh gloves and masks were worn during both water collection and filtration to further reduce contamination potential.
2.3 DNA extraction and purification procedures
After samples were transported to land, eDNA extraction and purification for all cruises were performed at the Molecular Marine Biology Laboratory, Atmosphere and Ocean Research Institute, The University of Tokyo, using a slightly modified version of the Miya et al. (2015) protocol, as described by Wong et al. (2020). For the KS-18-5 cruise, extraction followed the original Miya et al. (2015) protocol and was conducted at Bioengineering Lab. Co., Ltd., Kanagawa, Japan. To minimize contamination risk, all equipment, including silicon tubes and connectors, was sterilized with a 1% bleach solution and thoroughly rinsed with Milli-Q water before each use.
The samples were processed in a dedicated eDNA laboratory reserved exclusively for molecular work, with no exposure to other tissue samples or PCR products. Stringent hygiene protocols were observed, including the use of new gloves and masks for each procedural step. The workspace was regularly disinfected with 0.1% bleach spray to maintain a contamination-free environment.
2.4 MiFish eDNA metabarcoding
Following DNA extraction, eDNA metabarcoding was conducted using the MiFish primer, which amplifies a ~ 170 bp fragment of the mitochondrial 12S rRNA gene, widely used for fish species detection due to its broad taxonomic coverage and resolution (Miya et al. 2015). The libraries were prepared via a two-step PCR (Miya et al. 2020), and sequencing was performed using Illumina platforms (HiSeq X, MiSeq, or HiSeq 2500) across cruises. All sequencing data were processed using a standardized pipeline: primers were removed (using DADA2, or Cutadapt for KH-23-3), quality filtering was applied, and a fixed 120 bp segment from the 3′ end of the ~ 170 bp amplicon was retained to reduce sequencing errors and ensure consistent read length across platforms (see Text S1, Supplementary Material 1; Fig. S1, Supplementary Material 2). For all cruises except KS-18-5, amplicon sequence variants (ASVs) were inferred using the DADA2 (or Cutadapt for KH-23-3) plug-in in QIIME2 (Bolyen et al. 2019). For the KS-18-5 cruise, sequencing data were obtained from Yu et al. (2022; Tables S4 and S5 of their manuscript), in which the reads had been clustered into operational taxonomic units (OTUs) based on a 97% sequence similarity threshold.
ASVs and OTUs were taxonomically assigned based on BLASTN searches (ver. 2.12.0–2.13.0; Altschul et al. 1990) against the MitoFish mitochondrial genome database (ver. 3.72; Sato et al. 2018b) and MiFish reference sequences (Miya et al. 2015), using thresholds of ≥ 97% sequence identity and query coverage. When multiple candidate species met these thresholds, the species with the longest alignment length (i.e., the highest number of matching bases) was selected. If multiple candidates shared the same alignment length, the sequence was assigned to a higher taxonomic rank (e.g., genus or family). Despite the use of different taxonomic units (OTUs vs. ASVs), station-wise comparisons among sampling methods (bucket, intake, and Niskin) were unaffected, as taxonomic assignments were consistent within stations across methods. The average number of quality-filtered reads per sample ranged from ~ 78,000 (KS-18-5) to ~ 591,000 (KS-21-24) across cruises. Detailed raw and quality-filtered read counts for all individual samples, along with the average values for each cruise, are presented in Table S1 (Supplementary Material 3).
In eDNA metabarcoding, a ceiling effect may occur when a few highly abundant taxa dominate the sequencing output, thereby masking the detection of less abundant species. Furthermore, higher read counts of a species in one sample compared to another do not necessarily reflect greater biological abundance—particularly for non-dominant taxa—due to potential variation in sequencing depth and amplification efficiency (Skelton et al. 2023). Similarly, a sample with more total reads does not inherently contain more species than one with fewer reads (Miya et al. 2015). To account for these biases, raw read counts were normalized to relative read abundance (RRA), expressed as the percentage of total reads within each sample. This normalization enables more meaningful and standardized comparisons of species composition across samples. To further improve detection accuracy, we applied a threshold of 0.5% RRA per sample to confirm species presence/absence. This 0.5% threshold was determined through preliminary visual inspection of species count distributions across various cutoffs (Fig. S2, Supplementary Material 2), which showed a marked decline in detected species at the 0.5% level, while increasing the threshold to 1% led to only a marginal reduction. These patterns indicate that a 0.5% cutoff effectively reduces false positives without substantially discarding true detections. Similar thresholds have been employed in previous eDNA metabarcoding studies to improve detection reliability and minimize erroneous identifications (Drake et al. 2022; Jackman et al. 2021).
The results of the onboard negative controls are summarized in Table S2 (Supplementary Material 3). In most cruises, no fish species were detected in these control samples, with the exception of KS-22-11, KS-21-8, and KS-22-15. During the KS-22-15 cruise, Alburnus alburnus (Common Bleak), a freshwater species, was detected exclusively in the negative control and not in any of the seawater samples, indicating a likely false positive. In contrast, Zacco platypus (Pale Chub), another freshwater species (Froese and Pauly 2024), was detected in the KS-22-11 cruise negative control sample as well as in seawater samples from that and some other cruises. Given its ecological improbability in open marine environments, Zacco platypus was entirely excluded from the dataset. All other species detected in the negative controls were either absent from the seawater samples or appeared at very low read counts. As described above, RRA values were used to account for sequencing bias, and a threshold of > 0.5% per sample was applied to determine species presence. Consequently, species with low read counts in the negative controls were automatically excluded due to their low RRA values.
2.5 Collection of environmental data
Environmental parameters—including seawater temperature, salinity, and chlorophyll-a concentration—were recorded systematically at 1 dbar intervals using a CTD system during seawater sampling. To reduce instrument noise, seawater temperature and salinity data were processed using a Gaussian filter (Gaussian sigma = 2.5 db, width = 11 db) using the “scipy.ndimage.gaussian_filter1d” function in Python 3.10.9 (Python Software Foundation, 2023). Potential density was calculated using the Gibbs Seawater Oceanographic package with “gsw.density.sigma.” Maximum depth was recorded at each sampling site. Mixed layer depth (MLD) was determined using a finite difference method based on potential density at 10 dbar, with a density threshold of Δσθ = 0.125 kg/m3, following Kara et al. (2000). Seasons were defined by sampling months, as outlined by Trusenkova and Ishida (2005). Sample collection time (day or night) was identified using local sunset and sunrise times, calculated via the “datetime” and “astral.sun” functions in Python. A comprehensive summary of all environmental data is provided in Table S3 (Supplementary Material 3).
2.6 Statistical analysis
Taxonomic differences, as proxied by relative frequency and relative family-level composition (proportional representation of species- and family-presence), were visually compared across sampling methods using “ggplot2” (Wickham 2016) in the statistical software R (version 4.3.1) accessed via R Studio (version 2024.09.1 + 394; R Core Team 2023). The relative frequency for each sampling method was determined by dividing the number of occurrences of each species by the total number of occurrences across all species detected within that method. Similarly, relative family composition percentage was calculated as the proportion of occurrences of each family relative to the total occurrences across all families within the same sampling method.
Rarefaction curves helped to evaluate sampling completeness and compare asymptotic species richness and biodiversity among methods. Asymptotic estimates, with 95% confidence intervals, were calculated using R’s “iNEXT” package (Hsieh et al. 2016). Hill's numbers (q = 0, q = 1, and q = 2) were utilized for these comparisons based on incidence frequency data, where the input reflects the number of samples in which each species was detected within each method. Hill’s numbers quantify different aspects of species diversity: q = 0 for species richness, q = 1 for effective species number (exponential of Shannon entropy), and q = 2 for the inverse Simpson index, reflecting species evenness and frequency. Equations for Hill’s numbers are provided in Equations S1, S2, and S3 (Supplementary Material 4).
We compared the differences in each diversity index (q = 0, q = 1, q = 2) among methods using a non-parametric Kruskal–Wallis test, followed by pairwise Wilcoxon rank-sum tests for post hoc comparisons. Statistical significance was set at P < 0.05.
Pairwise comparisons revealed significant differences in the q = 2 diversity index across sampling methods based on pooled species data (see resultes). We, therefore, performed cluster analysis, using the “cluster” package (Maechler et al. 2023), to categorize sampling sites based on their species composition. The optimal number of clusters was determined through elbow and silhouette methods. Clusters were mapped using the Python library “cartopy” (https://github.com/SciTools/cartopy), and species compositions within clusters were visualized with the R package “pheatmap” (Kolde 2019). Indicator species analysis was performed using the multipatt function from the “indicspecies” package in R (De Cáceres et al. 2010) to identify species that are significantly associated with specific clusters. Alpha diversity indices (Shannon and Simpson) were also compared among methods within each cluster using the Kruskal–Wallis test (α = 0.05), followed by Wilcoxon tests for pairwise comparisons.
The spatial distribution of each cluster was further analyzed with non-metric multidimensional scaling (nMDS) using Jaccard dissimilarity. We visually explored relationships between clusters and underlying environmental variables using the ‘envfit’ function from the “vegan” package (Oksanen et al. 2022). We used a permutational multivariate analysis of variance (PERMANOVA), via the “adonis” function, also from “vegan” package with statistical significance set at P < 0.05 and 999 permutations to formally assess how much variation in fish community composition could be explained by the predictor variables of sampling method, season, cluster (i.e., region), time of sampling (day/night), and environmental variables (chlorophyll-a concentration, MLD, and temperature). Multicollinearity among environmental variables was assessed using the variance inflation factor (VIF) method with “calculate_vif” tool in Python 3.10.9. Potential density was excluded due to high VIF values, indicating multicollinearity with other predictor variables. The remaining variables showed low VIF values (see Table S4, Supplementary Material 3), confirming their suitability for analysis.
3 Results
3.1 Taxonomic richness
Our eDNA metabarcoding analysis from three surface sampling methods (bucket, intake, and Niskin) collectively detected 465 different fish taxa across 83 sampling stations, with each method individually identifying at least 324 species. The relative frequencies of fish taxa across the three methods were consistent (Fig. 2a). The dominant families identified—Carangidae, Clupeidae, Cottidae, Cyprinidae, Exocoetidae, Gobiidae, Gonostomatidae, Labridae, Mullidae, Myctophidae, Salmonidae, Scombridae, Serranidae, Tetraodontidae, Monacanthidae, and others—were detected across all methods, with relative composition percentages being nearly identical (Fig. 2b).
Comparison of taxonomic richness and family composition across three sampling methods (bucket, intake, and Niskin). a Relative frequency of the top 50 fish species detected by each method, with other species grouped as 'Others'. b Relative composition of the top 15 fish families observed using each method, with all other families grouped as 'Others'
3.2 Hill’s diversity indices across eDNA sampling methods
Hill’s numbers (q = 0, q = 1, q = 2) were analyzed to characterize marine surface fish diversity across the three eDNA sampling methods (bucket, intake, and Niskin) as a function of sampling effort (Fig. 3A–C). Species richness (q = 0), representing the total number of detected fish species, did not show significant differences among the sampling methods (P > 0.05) (Fig. 3A). Similarly, the Shannon index (q = 1), which accounts for both species detection and evenness using the exponential of Shannon entropy, showed no significant variation across methods (p > 0.05) (Fig. 3B). However, the inverse Simpson index (q = 2), which gives more weight to the proportion of frequently detected species, showed a significantly higher value in the bucket samples (p < 0.05). In contrast, the intake and Niskin samples did not differ significantly (p > 0.05) (Fig. 3C). Details of the statistical tests are provided in Tables S5a and S6a (Supplementary Material 3).
Rarefaction and extrapolation curves for Hill’s diversity indices (q = 0, q = 1, and q = 2) across three eDNA sampling methods (bucket, intake, and Niskin) for marine surface fish diversity. A Richness (q0) represents species richness, indicating the total number of detected species. B Shannon index (q1) accounts for the exponential of Shannon entropy. C The inverse Simpson index (q2) emphasizes species evenness. Solid lines represent rarefaction estimates, and dashed lines represent extrapolation beyond observed data. Shaded areas indicate 95% confidence intervals for each sampling method
To address the bioinformatics inconsistency in taxonomic assignment, we excluded data from the KS-18-5 cruise which used a different filter (0.22 µm) and OTU approaches instead of ASVs used elsewhere. Despite excluding this data, we still observed a higher inverse Simpson index (q = 2) in bucket samples (Fig. S3, Supplementary Material 2; Tables S5b and S6b, Supplementary Material 3). We then analyzed Niskin samples collected at the same time at two very close depths (5 m and 10 m) during the KS-18-5 cruise. No significant differences were observed in species richness (q = 0), Shannon index (q = 1), or Inverse Simpson index (q = 2) between the 5 m and 10 m depths (P > 0.05) for this cruise (Fig. S4, Supplementary Material 2). In addition, both depths shared 32 of the 52 detected species. These results suggest that depth within this range did not significantly affect biodiversity indices or species detection in the Niskin samples collected during the KS-18–5 cruise.
3.3 Fish community composition of marine surface
To further investigate the fish community composition in the marine surface environment, we conducted a cluster analysis using Ward’s linkage method and Jaccard distances based on presence/absence data. This analysis revealed eight distinct clusters across all samples collected from the three sampling methods (bucket, intake, and Niskin) (Fig. 4). Mapping the cluster positions for each method showed that seven of the eight clusters were consistently detected across all methods, with similar overall distributions (Fig. 5). However, samples collected using the intake method were not included in Cluster 2. Stations identified as belonging to Cluster 2 in the bucket and Niskin samples were replaced by Cluster 5 in the intake samples for those stations. Additionally, one Niskin sample is also classified as Cluster 5 in place of Cluster 2 (Fig. 5). To further examine method-specific patterns, we conducted cluster analyses independently for each sampling method (bucket, intake, and Niskin) using Jaccard distances. Each method yielded six clusters, named according to their order of appearance in the dendrograms (Fig. S5a–c, Supplementary Material 2). Clusters representing corresponding groups of sampling sites—based on geographical proximity—were generally consistent across methods (Fig. S6, Supplementary Material 2).
Cluster dendrogram of fish community composition in the marine surface environment, based on presence/absence data from bucket, intake, and Niskin sampling methods. The analysis was conducted using Ward’s linkage method with Jaccard distances, revealing eight distinct clusters, each highlighted by a different colored box. Branch colors represent sampling methods: blue for bucket, pink for intake, and green for Niskin, corresponding to the stations sampled with these methods
Spatial distribution of identified fish species clusters across the three eDNA sampling methods: a bucket, b intake, and c Niskin. Each map illustrates the geographic positions of the different species clusters (1–8) identified using hierarchical clustering with Ward linkage based on Jaccard similarity. The symbols and colors represent distinct clusters. Notably, Cluster 2 is absent in the intake method. Caution is advised as some stations have very close or almost identical positions, leading to overlapping symbols (e.g., Cluster 1 and Cluster 6)
Alpha diversity indices (Shannon and Simpson) were largely similar among methods within each cluster (Kruskal–Wallis test followed by pairwise Wilcoxon test), except for Clusters 5, 6, and 7. In Cluster 5, bucket samples had significantly higher biodiversity than both intake samples (p < 0.01) and Niskin samples (p < 0.05). Similarly, in Cluster 7, bucket samples exhibited slightly higher biodiversity than Niskin samples (p < 0.05) for both indices, with no significant difference between bucket and intake samples. In Cluster 6, bucket samples exhibited slightly higher biodiversity than Niskin samples (p < 0.05) for the Shannon index only. In all of Clusters 5, 6, and 7, biodiversity indices did not significantly differ between Niskin and intake samples (Fig. 6). After excluding the KS-18-5 cruise to minimize inconsistencies related to bioinformatics processing and filter type, we identified eight clusters (see dendrogram in Fig. S7a, Supplementary Material 2). Similar to the results from the full dataset including all cruises, both biodiversity indices did not differ significantly among methods in most clusters (P > 0.05, Kruskal–Wallis test followed by pairwise Wilcoxon test), except in Clusters 2a, 6a, and 8a (Fig. S7b, Supplementary Material 2). These three clusters (2a, 6a, and 8a), which showed significant differences (P < 0.05) in biodiversity indices among sampling methods after excluding the KS-18-5 cruise, were located in the same geographical regions (Fig. S8, Supplementary Material 2) as the previously defined Clusters 5, 7, and 6, respectively, where similar differences in biodiversity indices were observed based on data from all cruises (Fig. 5).
Boxplots of alpha diversity indices (a. Shannon and b. Simpson) for each cluster, comparing the bucket, intake, and Niskin sampling methods. Kruskal–Wallis and post-hoc Wilcoxon tests were applied to determine significant differences between methods. NS: not significant. Significance levels: *P < 0.05; **P < 0.01
Cluster-wise rarefaction and extrapolation curves of Hill numbers (q = 0, 1, and 2) also indicated no significant differences among the three sampling methods—bucket, intake, and Niskin—in most clusters (Fig. S9a and S9b, Supplementary Material 2; Table S7, Supplementary Material 3). However, some clusters exhibited method-specific differences. For example, in Cluster 5, bucket samples showed higher diversity estimates than both intake and Niskin samples for q = 1 and q = 2 (P < 0.05), while in Cluster 7, bucket samples showed higher diversity than Niskin samples for q = 1 only (P < 0.05). These results are highly consistent with the alpha biodiversity findings, where Shannon and Simpson indices also showed significantly higher values in Bucket samples within the same clusters. Species richness (q = 0) varied significantly in Cluster 1 (intake > Niskin), Cluster 6 (bucket and intake > Niskin), and Cluster 7, where all three methods differed significantly from each other. No significant differences in Hill numbers were observed among methods in the remaining clusters (P > 0.05; Table S7, Supplementary Material 3).
Each cluster was characterized by the presence of one or more indicator species, which help define and distinguish the clusters from one another (Table 2). A heatmap showing the detection frequencies of the top 10 most frequently detected species in each cluster is presented in Fig. 7. The geographical distribution of the clusters revealed by eDNA metabarcoding data from the bucket, intake, and Niskin methods showed that the clusters were associated with different regions. Cluster 1 was located in the Oyashio region (subarctic gyre). Cluster 2 was along the Kuroshio Extension. Clusters 3 and 4 were in the coastal region of the northern part of the mixed water region. Cluster 5 was widespread along the coastal region of the southern part of the mixed water region, as well as the Pacific coast of the Kuroshio region. Cluster 8 was associated with the Kuroshio Current and Tsushima Current, and Cluster 7 was located in the East China Sea (Fig. 5). Clusters 2 and 5 are geographically close, with the latter replacing the former when considering biodiversity patterns as revealed by just the intake method (Fig. 5).
Heatmap showing the species composition across the eight clusters, illustrating the detection frequency of the top 10 most detected species in each cluster. The color gradient represents the number of times each species was detected within each cluster, with blue indicating low detection frequency and red indicating high detection frequency. Species not in the top 10 for each cluster are grouped under "Others."
The PERMANOVA analysis revealed that multiple factors contributed to variation in fish community composition, with the highest variation explained by cluster (region) at 25.4%, followed by season at 11.3%. In contrast, the effect of method accounted for only 1.4% of the variation, followed by the influence of temperature (1.08%) (Table 3). However, as the cluster (region) variable was derived a priori from the same species matrix, we also tested a model excluding this factor. Notably, the variation explained by method remained unchanged at 1.4% (Table S8, Supplementary Material 3).
3.4 Influence of environmental parameters
Non-metric multidimensional scaling (nMDS) was conducted to visualize patterns in species composition across different sampling methods (bucket, intake, and Niskin) using the Jaccard dissimilarity index. The nMDS analysis produced a two-dimensional solution with a stress value of 0.164, effectively distinguishing the ordination of the eight identified clusters (Fig. 8). When ordinations were performed independently for each method, the overall patterns were broadly consistent, with minor differences likely reflecting variations in stress values: Bucket (0.159), Intake (0.162), and Niskin (0.116) (Fig. S10, Supplementary Material 2).
nMDS ordination plot (Stress = 0.164) illustrating cluster patterns across three sampling methods (bucket, intake, and Niskin) using the Jaccard dissimilarity index. Points represent samples, colored by method (blue for bucket, pink for intake, and green for Niskin), and shaped according to the identified clusters (1–8). Ellipses denote the dispersion of each cluster at a 95% confidence interval. Significant continuous environmental parameters (temperature, chlorophyll-a (Chl_a), and salinity) are displayed as vectors, and the centroid positions of significant seasonal categories (winter, spring, summer, and fall) are shown as orange points with labels. These vectors and centroids illustrate the influence of environmental variables on the clusters and their separation in the ordination space
To evaluate the influence of environmental variables on community composition, we applied the “envfit” function to fit both continuous variables (temperature, salinity, chlorophyll-a, mixed layer depth, and maximum depth at sampling sites) and categorical variables (season and time of day for sample collection) onto the nMDS ordination.
The continuous variables that showed significant correlations with the ordination were temperature (R2 = 0.59, P < 0.001), salinity (R2 = 0.58, P < 0.001), and chlorophyll-a (R2 = 0.26, P < 0.001) (Table S9, Supplementary Material 3). Among the categorical variables, “season” demonstrated a significant association (R2 = 0.26, p < 0.001). To enhance clarity, only significant parameters are displayed on the nMDS plot (Fig. 8). These findings suggest that the clusters may be influenced by both environmental factors (continuous and seasonal) as well as geographical position. Specifically, Cluster 1 was associated with lower salinity; Cluster 2 with higher salinity and the spring season; Clusters 3 and 4 with elevated chlorophyll-a levels; Cluster 5 with a combination of multiple variables (cooler temperature, higher Chlorophyll-a and salinity during winter and spring) distributed across a larger area on the ordination plot; Cluster 6 with higher temperatures; and Clusters 7 and 8 with favorable temperatures, differentiated by the fall and summer seasons, respectively (Fig. 8).
4 Discussion
4.1 Assessment of marine surface fish diversity in the western north pacific
eDNA metabarcoding detects more species than traditional methods like trawling, offering a broader, more comprehensive view of fish diversity (He et al. 2023; Fraija-Fernández et al. 2020). This technique has proven to be a reliable tool for continuous biodiversity monitoring (Stoeckle et al. 2021). It also holds promise in assessing threatened species’ distributions and population trends over time, which is essential for effective conservation and management (Zou et al. 2020). In this study, we identified 465 species across three sampling methods. Most species’ distributions were successfully cross-referenced with FishBase (Froese and Pauly 2024; Boettiger et al. 2012) (Table S10, Supplementary Material 3), arranged by Nelson’s Fishes of the World (5th edition) (Nelson et al. 2016). The dominant families identified align with previous studies (Orlov and Tokranov 2019; Kawakami et al. 2023), with Myctophidae as the most abundant. Our study also detected rare species—those found at very low frequencies across samples—many of which are typically challenging to capture with traditional methods (Stat et al. 2019). This is supported by the observation that the top 50 most frequently detected taxa accounted for more than 50% of total detections across all three methods, despite a total of 465 taxa being identified (Fig. S11a, b, Supplementary Material 2), highlighting eDNA’s potential to uncover hidden biodiversity.
A key finding of our study is that each of the three sampling methods (bucket, intake, and Niskin) yielded similar taxonomic identifications, with each method detecting at least 324 taxa from 83 sampled stations and over 480 asymptotic counts based on rarefaction and exploration analyses (Table S6a, Supplementary Material 3). The consistency in taxa and families across the sampling methods is likely due to the filtration of a substantial volume of water (~ 10 L). This volume may be sufficient to detect most fish species, but for more accurate and comprehensive species detection, larger volumes (e.g., > 100 L) may be required (Kawakami et al. 2023). While larger volumes generally improve species detection, the benefit plateaus beyond a certain threshold (Cantera et al. 2019). Ahn et al. (2022) observed that, in their winter trial, a single 10-L sample detected more OTUs than five combined 2-L replicates, highlighting the potential benefit of using sufficient volume in a single sampling event. However, they cautioned that this result may reflect the sparse distribution of eDNA rather than a consistent advantage of larger volume. Similarly, Kawakami et al. (2023) reported that smaller volumes, such as 3L, require tens of replicates to reliably capture the fish community in surface waters, or a greater total filtration volume is needed for more accurate results. For deep ocean metabarcoding, Yoshida et al. (2023) found that 10–20L is optimal for biodiversity monitoring, but more than 10 replicates are necessary for robust species detection. Although we did not directly compare variation arising from filtration volume and sampling method, the observed consistency in taxa across methods suggests that filtering a sufficiently large volume (e.g., ~ 10 L) may help minimize variability in species detection among methods. Therefore, filtering as much water as possible—ideally with multiple replicates—remains essential for enhancing detection reliability within logistical constraints.
Since eDNA can decay, disperse, and diffuse in marine environments, a minimum species abundance is required to shed enough DNA to enable reliable detection (Barnes and Turner 2016). Although many of the detected species in this study are challenging to identify using traditional methods, their widespread distribution in the study area (Fujikura et al. 2010) likely contributed to the minimal differences in taxonomic identification among the sampling methods. Overall, our results—particularly the similar taxonomic profiles and consistent rarefaction trends—suggest that each of the three sampling methods (bucket, intake, and Niskin) is effective for detecting surface marine fish taxa and can be reliably employed in marine surface ecosystems.
4.2 Geographical distribution and composition of marine surface community
The geographical distribution of fish species clusters, as identified through eDNA metabarcoding using bucket, intake, and Niskin sampling methods, revealed highly consistent clustering patterns across all three methods, reflecting distinct biogeographic regions. Based on our results, the sampling stations were broadly divided into two major regions, as indicated by the clustering structure in the dendrogram (Fig. 4), with the separation occurring around 34°N, as observed in the latitudinal gradient of the spatial distribution map (Fig. 5).
South of this boundary, three clusters—Clusters 6, 7, and 8—were consistently identified by all methods, demonstrating the robustness of eDNA metabarcoding in capturing spatial biodiversity patterns (West et al. 2020). Cluster 8, situated along the western coast of Honshu and influenced by the Kuroshio Current, likely supports fish communities adapted to warm, saline waters (Yu et al. 2023; Zhang et al. 2020). Cluster 7, associated with the East China Sea region, is also under the influence of the Kuroshio Current, highlighting the role of warm, subtropical waters in shaping regional biodiversity (Guo et al. 2006). Cluster 6, located in the offshore open ocean, represents a pelagic environment supporting diverse marine fish species (Fujikura et al. 2010).
North of the 34°N boundary, five clusters were identified: Cluster 1 in the sub-Arctic region and Clusters 2, 3, 4, and 5 along the Sanriku coast. These clusters were also consistently detected across all three sampling methods, with the exception of Cluster 2, which was not included in the intake samples. Cluster 1, shaped by the cold, nutrient-rich Oyashio Current, reflects a sub-Arctic environment that supports species adapted to low temperatures and high productivity (Ishak et al. 2020). Clusters 3 and 4, influenced by the convergence of the Kuroshio and Oyashio currents, are characterized by intermediate temperature and salinity conditions that promote nutrient availability and support species adapted to dynamic environments (Yasuda 2003; Yamamoto et al. 2017). Cluster 2, influenced by the warm, saline waters of the Kuroshio Current, may harbor fish species specific to Kuroshio-dominated systems. Cluster 5, which spans a wide area along the eastern coast, exhibits similar hydrographic conditions and likewise reflects the Kuroshio’s influence on fish community composition (Yu et al. 2022) (Fig. 5).
These regional and cluster-specific biodiversity patterns are further supported by nMDS analyses incorporating environmental variables (Fig. 8), reinforcing the ecological relevance and robustness of the clustering derived from eDNA metabarcoding (Jeunen et al. 2019). With only minor discrepancies in a few assignments across the three methods, the convergence of biogeographic patterns underscores the effectiveness of each of the bucket, intake, and Niskin sampling methods in capturing spatial biodiversity via eDNA metabarcoding. Furthermore, the correspondence between cluster distribution and major ocean currents suggests that physical and oceanographic features surrounding Japan play a key role in shaping distinct ecological zones (Ashjian et al. 2006).
The eight distinct clusters of fish communities, each characterized by indicator species and environmental factors, highlight eDNA metabarcoding’s ability to distinguish localized fish assemblages (Yamamoto et al. 2017; Jeunen et al. 2019). Key factors such as temperature, chlorophyll-a, salinity, and seasonality likely contribute to these clusters. As ectotherms, fish need to remain within their optimal temperature range; chlorophyll-a indicates areas of high fish biomass (Sato et al. 2021); salinity correlates with eDNA concentrations and species’ habitat preferences (Takahara et al. 2019); and seasonal sampling enhances eDNA detection during breeding or migration (Hayami et al. 2020; Inui et al. 2021).
The clusters varied in their responses to environmental variables. For example, Cluster 1, which included several Salmonidae species, reflected the geographical distribution characteristics of salmonids and their preference for regions with lower salinity (Sato et al. 2018a) and lower temperatures (Stehfest et al. 2017), although these environmental preferences can vary depending on developmental stage. Clusters 3 and 4 included species such as Repomucenus planus, Sardinella zunasi, Laemonema longipes, Lateolabrax japonicus, and Leuroglossus schmidti, which generally feed on zooplankton and crustaceans. These food sources are directly linked to primary productivity, as zooplankton and crustacean abundance increases in environments with high chlorophyll-a levels. Consequently, these fish species thrive in such conditions (Lowerre-Barbieri et al. 2017; Sato et al. 2021). Cluster 5 is characterized by Myctophum spinosum, a species commonly found in tropical and subtropical waters and often appearing in the bycatch of deep-sea trawlers. These fish are opportunistic carnivores, feeding on crustaceans and small fishes, and constitute a significant portion of the mesopelagic zone's biomass (Vipin et al. 2015). Additionally, cluster 5 is dominated by sardines, anchovies, and mackerel, highlighting the ecological plasticity of these species. Their ability to thrive in diverse environments is likely driven by factors such as temperature and chlorophyll-a availability (Froese and Pauly 2024). Cluster 6, characterized by Katsuwonus pelamis and Oxyporhamphus micropterus, represents species highly adapted to pelagic, warmer waters and specific salinity gradients, reflecting their reliance on specific oceanographic conditions (Collette and Nauen 1983; Froese and Pauly 2024).
Clusters 7 and 8, composed of various lanternfish and other mesopelagic fish, differ not only in their geographical locations but also in their seasonal associations. Cluster 7 aligns with fall, while Cluster 8 aligns with summer, reflecting seasonal, life-history-dependent differences in exploiting periods of high biological activity (Hayami et al. 2020; Inui et al. 2021).
Overall, the spatial and seasonal environmental associations of each cluster are consistent with previous knowledge of species’ ecology and life history and emphasize the importance of accounting for both spatial and temporal variation in marine biodiversity assessments. In our study, spatial structuring (Cluster) and seasonal variation emerged as the dominant contributors to fish community dissimilarity, while only minor differences were attributed to sampling method. These method-specific differences likely stemmed from very specific presence/absence cases involving certain species, such as Pacific Saury (Cololabis saira) (Ahmed et al., 2025, submitted). Thus, our observed biodiversity patterns mostly likely reflect true spatio-temporal patterns rather than methodological artifacts. Further supporting this assertion was that all our species clusters were consistently detected across the bucket, intake, and Niskin sampling methods, except for Cluster 2, which was absent in the intake sampling. Likewise, negligible differences were generally observed in Shannon and Simpson diversity indices estimated using data from different sampling methods within clusters.
In shallow, well-mixed marine areas, eDNA concentrations and estimated community composition can be similar between surface and bottom samples. This pattern has been attributed to the likely minimal vertical stratification of eDNA in such environments (Andruszkiewicz et al. 2017). Since all our collected samples were taken within the mixed layer (Table S3, Supplementary Material 3), any observed community differences for specific clusters or methods are unlikely the result of differences in sampling depth. This interpretation is supported by our nMDS analysis (Fig. 8) and PERMANOVA results (Table 3). In addition, the comparison between 5 and 10 m Niskin samples also supports the interpretation (Fig. S4, Supplementary Material 2).
4.3 Differences among methods
While all three sampling methods demonstrated comparable taxonomic richness and community composition, the bucket samples showed higher inverse Simpson index values and biodiversity indices, particularly in Clusters 5 and 7, than Niskin and intake samples. This difference reflects a greater evenness in bucket samples, where the proportion of frequently detected species was lower than in intake and Niskin samples (Fig. S11a, b, Supplementary Material 2). Two factors may explain this increased evenness in bucket samples.
First, transparent exopolymer particles (TEPs), which are abundant in the sea surface microlayer, may influence eDNA capture and detection. Some TEPs are lighter than seawater and tend to accumulate at the surface, potentially trapping eDNA into a microlayer (Wurl et al. 2011). This accumulation is influenced by factors such as wind speed, stratification, and regional productivity (Zamanillo et al. 2019; Engel and Galgani 2016). Due to their high stickiness, TEPs aggregate with other organic and inorganic particles. Furthermore, their lighter density allows TEP-rich aggregates to remain suspended or even ascend, enabling them to remain in the surface microlayer unless ballasted by denser particles (Mari et al. 2017; Wurl et al. 2009). This TEP-enriched surface microlayer enhances light scattering, traps and aggregates particulates, and could thus facilitate the accumulation of eDNA in the water column, potentially increasing its detectability (Stramski et al. 2019). As a result, the TEP-enriched surface microlayer may serve as a reservoir for genetic material from various sources, including rare and less abundant species. Consequently, the aggregation and retention of eDNA in the surface microlayer—driven by TEPs (Wurl and Holmes 2008; Thornton 2014) and less accessible to deeper sampling methods like intake and Niskin—may lead to the detection of more species in bucket samples, which likely explains the higher evenness observed in those samples. However, compared to the vast volume of the global ocean, the surface microlayer is extremely limited in total volume (Cunliffe et al. 2013) and is primarily formed under weak to moderate wind conditions and the presence of surfactants (both natural and anthropogenic) (Wurl and Holmes 2008; Wurl et al. 2011). Therefore, despite our research being conducted in the dynamic Kuroshio-Oyashio system, intake sampling can be applied globally due to the limited presence of the microlayer in the ocean, which leads to better evenness and biodiversity in bucket samples.
Second, a ceiling effect related to high eDNA read counts may influence metabarcoding results (Skelton et al. 2023). UV radiation degrades eDNA more rapidly in surface layers, reducing the eDNA pool density, especially for species that release genetic material in these upper layers (Strickler et al. 2015). This degradation may mitigate the ceiling effect by lowering the read counts for frequently detected species, thereby allowing rarer species to be detected in bucket samples (Fig. S11a and S11b, Supplementary Material 2). In contrast, samples collected using intake and Niskin methods are less exposed to UV, which may lead to a more pronounced ceiling effect, favoring the prevalence of frequently detected species (Lamb et al. 2019). Consequently, bucket samples tend to exhibit higher evenness and potentially a higher inverse Simpson index.
Filter pore size is also an important consideration in fish eDNA metabarcoding studies, as it can influence the number and diversity of species detected (Li et al. 2018). Filters with a 0.45 μm pore size are consistently used in fish eDNA metabarcoding studies (Roblet et al. 2024; Li et al. 2018). Larger filter pore sizes (e.g., 0.45 μm) do not necessarily decrease the yields of eDNA (Jo et al. 2020), while 0.22 μm filters may improve the completeness of fish species identification in situations when the DNA is degraded (Kawakami et al. 2023). In our study, we conducted separate analyses by excluding data from the KS-18–5 cruise, which used a different filter size (0.22 μm) compared to the other cruises (0.45 μm). Our results revealed similar trends in fish biodiversity and community composition estimates, regardless of whether we included all cruise data or excluded the KS-18–5 data.
In coastal and open ocean systems, eDNA decay rates are key factors in interpreting eDNA signals (Pastor Rollan et al. 2024; Andruszkiewicz et al. 2019). These decay processes are influenced by several environmental factors, including temperature and water chemistry (Jo and Minamoto 2020; Sassoubre et al. 2016), and potentially sunlight—although some studies have reported no significant impact of light exposure on eDNA degradation (Andruszkiewicz et al. 2017). Additionally, the ecology of eDNA—its state and characteristics—in the open ocean remains largely unknown. Therefore, the differences observed among sampling methods (for example, higher biodiversity in bucket samples in certain regions) may be attributed to the physical characteristics (e.g., DNA buoyancy, particle size) and state of eDNA in the water column (Jo and Minamoto 2020).
Another notable difference was the absence of Cluster 2 in the intake samples, despite its presence in bucket and Niskin methods. A closer examination of the sampling stations revealed that intake samples replaced Cluster 2 with Cluster 5 (and one Niskin sample also showed this replacement) (Fig. 5). All Cluster 2 samples were collected during the KS-18–5 cruise where the intake samples were collected at a different time of day compared to bucket and Niskin methods (which were collected simultaneously) (see Fig. S12, Supplementary Material 2). Temporal differences in eDNA sampling may result in biases associated with inter-species differences in diurnal activity (Suzuki et al. 2022) or observational biases due to ocean advection (Yu et al. 2022). All intake samples, as well as the single Niskin sample identified as Cluster 5, were collected during the daytime.
Further examination of the KS-18-5 intake samples and the Niskin sample classified as Cluster 5 (hereafter referred to as Cluster 5_I_N) revealed that their species composition closely resembled the overall species profile of Cluster 5 (Fig. S13a, b; Supplementary Material 2). In contrast, Cluster 2 was characterized by Chlorurus sordidus (see Fig. S13a, Supplementary Material 2); Table S11 (Supplementary Material 3)), a coastal species likely transported offshore by the Kuroshio Current (Yoo et al. 2004) and not detected in all samples.
Myctophum spinosum—the indicator species for Cluster 5—was likewise identified as the indicator species for Cluster 5_I_N (Table S11, Supplementary Material 3), accompanied by other Myctophidae species (Fig. S13a, b; Supplementary Material 2). These mesopelagic fishes are known for their nocturnal vertical migrations to surface waters for feeding (Watanabe et al. 2002). However, this study revealed higher occurrences of detection of these species in daytime samples compared to nighttime samples during the KS-18-5 cruise which may result from eDNA persistence from their nocturnal activity (Barnes and Turner 2016) or local behavioral variations (Giske et al. 1998). When comparing day and night eDNA detections in the KS-18-5 cruise, we did not observe strong variation between the two, except for Myctophum spinosum and Sardinops melanostictus/S. sagax, which were better detected (P < 0.05) during the day (Fig. S14, Supplementary Material 2). To fully understand potential diel vertical migrations in fish behavior, samples should be collected at regular intervals (e.g., every 4 h). Future research should be conducted to investigate this phenomenon in greater detail. Owing to the time difference in sampling between the KS-18-5 intake and other methods, we excluded these KS-18-5 samples from biodiversity rarefaction analysis for q = 2, as bucket samples had significantly higher q = 2 values. Nonetheless, bucket samples maintained higher diversity than intake and Niskin (P < 0.05) (see Fig. S3, Supplementary Material 2), indicating that the discrepancies in cluster classification and the higher biodiversity index in bucket samples are attributable to different underlying factors, as discussed above. These differences mainly stem from environmental conditions and sampling timing, rather than sampling efficiency.
A key limitation of this study is the lack of replicates within each sampling method, which prevents a full assessment of the variability within and between methods. As eDNA results can vary significantly even among concurrent samples from the same location, replicates are crucial for capturing a comprehensive range of species. While studies recommend using 7–8 replicates and filtering more than 100 L of water to adequately capture fish diversity (Kawakami et al. 2023; Stauffer et al. 2021), implementing this ideal design was not feasible during the cruises due to logistical constraints (e.g., limited space, restricted ship stationary time at each station, and limitations in storage and filtration capacity). Instead, we analyzed a large dataset of concurrently collected water samples across three methods, compiled from multiple cruises conducted under varying oceanographic conditions to enable robust comparisons among sampling methods. This study design reflects a pragmatic balance between methodological rigor and the logistical realities of large-scale marine surveys, contributing to future efforts aimed at optimizing open ocean eDNA sampling protocols. Additionally, variations in the plumbing systems used for seawater intake across ships may introduce unintended methodological variation.
4.4 Practical recommendations
Although the three sampling methods demonstrated comparable performance in assessing marine surface fish community composition and biodiversity, it is important to compare their practical trade-offs when selecting a method. Bucket sampling may provide a better representation of surface communities in the surface microlayer, but it requires the vessel to stop and involves manual, labor-intensive water collection (Miyata et al. 2022). Intake sampling, in contrast, is less disruptive to ship operations (Kawakami et al. 2023) and is widely accessible across various global ocean vessels worldwide (Hoshina et al. 2018; Montoto-Martínez et al. 2020). However, it poses a higher risk of contamination, particularly from heavy metals in ship plumbing systems (Greco et al. 2022). The Niskin method, while also requiring the ship to stop (Yu et al. 2022), remains the most reliable option for depth-specific water collection (Govindarajan et al. 2021). A detailed summary of these trade-offs is provided in Table 4.
Therefore, the selection of a sampling method should be guided by the research objectives, logistical constraints, and the required depth resolution. Careful consideration of these factors will help ensure effective and efficient application of eDNA metabarcoding for monitoring marine surface fish communities. Since each method can broadly capture and distinguish distinct fish community structures on its own, integrating data from all methods offers a more comprehensive approach to biodiversity assessments in marine surface environments.
5 Conclusion
This study demonstrates the effectiveness of three surface sampling methods—bucket, intake, and Niskin—for assessing fish biodiversity and community composition using eDNA metabarcoding. Each method reliably captured a broad range of species and community structures, offering complementary insights into the complex dynamics of marine surface ecosystems. While minor differences between methods exist, understanding these differences allows for more informed interpretation of eDNA data. Although the combined use of all three methods provides a more comprehensive perspective, it is often impractical in large-scale ocean surveys due to logistical constraints. Therefore, in situations where all three methods cannot be utilized, any one of them can be prioritized based on resource availability and ease of sampling.
Data Availability
Raw read data have been submitted to NCBI (BioProject: PRJNA1241902 and PRJNA1262646). These large data have been used for several studies, including the analysis of species-specific detection performance across sampling methods in the surface layer (Ahmed et al., 2025, submitted), biogeographic boundaries at the Takara Gap (Inoue et al., 2025, submitted), and fish species composition, including subsurface layers (Lin et al., 2025, submitted). All relevant data are provided within the manuscript and its supplementary materials, available in four DOCX files titled Supplementary Material 1, Supplementary Material 2, Supplementary Material 3, and Supplementary Material 4, as well as one Excel file named Supplementary Material 5. The scripts used for data analysis and figure generation in this study are available at https://github.com/iahmedBD2020/Ahmed-et-al.-2025-JO-.
References
Afzali SF, Bourdages H, Laporte M, Mérot C, Normandeau E, Audet C, Bernatchez L (2021) Comparing environmental metabarcoding and trawling survey of demersal fish communities in the Gulf of St. Lawrence, Canada. Environ DNA 3:22–42. https://doi.org/10.1002/edn3.111
Ahmed SI, Yu Z, Higuchi T, Inoue J, Wong MK-S, Yoshizawa S, Itoh S, Komatsu K, Kawaguchi Y, Tsutsumi E, Lin Y, Hyodo S, Morrongiello J, Ito S (2025, submitted) Fish species detection by environmental DNA metabarcoding by three surface sampling methods. Reg Stud Mar Sci
Ahn H, Asano Y, Suzuki S, Ooki A (2022) Positive effect of filtration additives for increasing environmental DNA detections in summer and winter oceanic waters. Environ DNA 4:661–673. https://doi.org/10.1002/edn3.282
Alexander JB, Marnane MJ, McDonald JI, Lukehurst SS, Elsdon TS, Simpson T, Hinz S, Bunce M, Harvey ES (2023) Comparing environmental DNA collection methods for sampling community composition on marine infrastructure. Estuar Coast Shelf Sci 283:108283. https://doi.org/10.1016/j.ecss.2023.108283
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215(3):403–410. https://doi.org/10.1016/S0022-2836(05)80360-2
Andruszkiewicz E, Sassoubre L, Boehm A (2017) Persistence of marine fish environmental DNA and the influence of sunlight. PLoS ONE 12:e0185043. https://doi.org/10.1371/journal.pone.0185043
Andruszkiewicz E, Koseff J, Fringer O, Ouellette N, Lowe A, Edwards C, Boehm A (2019) Modeling environmental DNA transport in the coastal ocean using Lagrangian particle tracking. Front Mar Sci. https://doi.org/10.3389/fmars.2019.00477
Ashjian C, Arnone R, Davis C, Jones B, Kahru M, Lee C, Mitchell B (2006) Biological structure and seasonality in the Japan/East Sea. Oceanography 19:122–133. https://doi.org/10.5670/OCEANOG.2006.49
Bakun A (2006) Fronts and eddies as key structures in the habitat of marine fish larvae: opportunity, adaptive response, and competitive advantage. Sci Mar 70(2):105–122. https://doi.org/10.3989/SCIMAR.2006.70S2105
Barnes M, Turner C (2016) The ecology of environmental DNA and implications for conservation genetics. Conserv Genet 17:1–17. https://doi.org/10.1007/s10592-015-0775-4
Boettiger C, Lang DT, Wainwright PC (2012) rfishbase: exploring, manipulating, and visualizing FishBase data from R. J Fish Biol 81:2030–2039. https://doi.org/10.1111/j.1095-8649.2012.03464.x
Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA et al (2019) Reproducible, interactive, scalable, and extensible microbiome data science using QIIME 2. Nat Biotechnol 37(8):852–857. https://doi.org/10.1038/s41587-019-0209-9
Brandt M, Trouche B, Quintric L, Günther B, Wincker P, Poulain J, Arnaud-Haond S (2021) Bioinformatic pipelines combining denoising and clustering tools allow for more comprehensive prokaryotic and eukaryotic metabarcoding. Mol Ecol Resour 21:1422–1436. https://doi.org/10.1111/1755-0998.13398
Cantera I, Cilleros K, Valentini A, Cerdan A, Dejean T, Iribar A, Taberlet P, Vigouroux R, Brosse S (2019) Optimizing environmental DNA sampling effort for fish inventories in tropical streams and rivers. Sci Rep 9:3085. https://doi.org/10.1038/s41598-019-39399-5
Claudet J, Fraschetti S (2010) Human-driven impacts on marine habitats: a regional meta-analysis in the Mediterranean Sea. Biol Conserv 143:2195–2206. https://doi.org/10.1016/j.biocon.2010.06.004
Collette BB, Nauen CE (1983) FAO species catalogue. Vol. 2. Scombrids of the world. An annotated and illustrated catalogue of tunas, mackerels, bonitos and other related species known to date. C 137
Collins RA, Wangensteen OS, O’Gorman EJ, Mariani S, Sims DW, Genner MJ (2018) Persistence of environmental DNA in marine systems. Commun Biol 1:185. https://doi.org/10.1038/s42003-018-0192-6
Cunliffe M, Engel A, Frka S, Gašparović B, Guitart C, Murrell J, Salter M, Stolle C, Upstill-Goddard R, Wurl O (2013) Sea surface microlayers: a unified physicochemical and biological perspective of the air–ocean interface. Prog Oceanogr 109:104–116. https://doi.org/10.1016/J.POCEAN.2012.08.004
De Cáceres M, Legendre P, Moretti M (2010) Improving indicator species analysis. Ecol Monogr 80(3):347–367. https://doi.org/10.1111/j.1600-0706.2010.18334.x
Drake LE, Cuff JP, Young RE, Marchbank A, Chadwick EA, Symondson WOC (2022) An assessment of minimum sequence copy thresholds for identifying and reducing the prevalence of artefacts in dietary metabarcoding data. Methods Ecol Evol 13:694–710. https://doi.org/10.1111/2041-210X.1378
Dumoulin LA, Chevrinais M, St-Louis R, Parent GJ (2024) Metal ions limit or enhance environmental DNA detectability in marine sediments. Environ DNA 6:e568. https://doi.org/10.1002/edn3.568
Engel A, Galgani L (2016) The organic sea-surface microlayer in the upwelling region off the coast of Peru and potential implications for air–sea exchange processes. Biogeosci 13:989–1007. https://doi.org/10.5194/BG-13-989-2016
Evans NT, Shirey PD, Wieringa JG, Mahon AR, Lamberti GA (2017) Comparative cost and effort of fish distribution detection via environmental DNA analysis and electrofishing. Fisheries 42:90–99. https://doi.org/10.1080/03632415.2017.1276329
Fonseca VG (2018) Pitfalls in relative abundance estimation using eDNA metabarcoding. Mol Ecol Resour 18:923–926. https://doi.org/10.1111/1755-0998.12902
Fraija-Fernández N, Bouquieaux M, Rey A, Mendibil I, Cotano U, Irigoien X, Santos M, Rodríguez-Ezpeleta N (2020) Marine water environmental DNA metabarcoding provides a comprehensive fish diversity assessment and reveals spatial patterns in a large oceanic area. Ecol Evol 10:7560–7584. https://doi.org/10.1002/ece3.6482
Froese R, Pauly D (Eds.) (2024) FishBase. Retrieved October 10, 2024, from https://www.fishbase.org
Fujii K, Doi H, Matsuoka S, Nagano M, Sato H, Yamanaka H (2019) Environmental DNA metabarcoding for fish community analysis in backwater lakes: a comparison of capture methods. PLoS ONE 14:e0210357. https://doi.org/10.1371/journal.pone.0210357
Fujikura K, Lindsay D, Kitazato H, Nishida S, Shirayama Y (2010) Marine biodiversity in Japanese waters. PLoS ONE 5:e11836. https://doi.org/10.1371/journal.pone.0011836
Gamfeldt L, Lefcheck J, Byrnes J, Cardinale B, Duffy J, Griffin J (2015) Marine biodiversity and ecosystem functioning: What’s known and what’s next? Oikos 124:252–265. https://doi.org/10.1111/oik.01549
Giske J, Huse G, Fiksen Ø (1998) Modelling spatial dynamics of fish. Rev Fish Biol Fish 8:57–91. https://doi.org/10.1023/A:1008864517488
Gold Z, Sprague J, Kushner D, Zerecero E, Barber P (2021) eDNA metabarcoding as a biomonitoring tool for marine protected areas. PLoS ONE 16:e0238557. https://doi.org/10.1371/journal.pone.0238557
Govindarajan A, Francolini R, Jech J, Lavery A, Llopiz J, Wiebe P, Zhang W (2021) Exploring the use of environmental DNA (eDNA) to detect animal taxa in the mesopelagic zone. Front Ecol Evol 9:574877. https://doi.org/10.3389/fevo.2021.574877
Greco M, Lejzerowicz F, Reo E, Caruso A, Maccotta A, Coccioni R, Pawlowski J, Frontalini F (2022) Environmental RNA outperforms eDNA metabarcoding in assessing impact of marine pollution: a chromium-spiked mesocosm test. Chemosphere 298:134239. https://doi.org/10.1016/j.chemosphere.2022.134239
Guo X, Miyazawa Y, Yamagata T (2006) The Kuroshio onshore intrusion along the shelf break of the East China Sea: the origin of the Tsushima warm current. J Phys Oceanogr 36:2205–2231. https://doi.org/10.1175/JPO2976.1
Hayami K, Sakata MK, Inagawa T, Okitsu J, Katano I, Doi H, Nakai K, Ichiyanagi H, Gotoh RO, Miya M, Sato H, Yamanaka H, Minamoto T (2020) Effects of sampling seasons and locations on fish environmental DNA metabarcoding in dam reservoirs. Ecol Evol 10:3030–3042. https://doi.org/10.1002/ece3.6279
He X, Jeffery N, Stanley R, Hamilton L, Rubidge E, Abbott C (2023) eDNA metabarcoding enriches traditional trawl survey data for monitoring biodiversity in the marine environment. ICES J Mar Sci 80:1529–1538. https://doi.org/10.1093/icesjms/fsad083
Higgs ND, Attrill MJ (2015) Biases in biodiversity: Wide-ranging species are discovered first in the deep sea. Front Mar Sci 2:61. https://doi.org/10.3389/fmars.2015.00061
Hoshina Y, Tohjima Y, Katsumata K, Machida T, Nakaoka S (2018) In situ observation of atmospheric oxygen and carbon dioxide in the North Pacific using a cargo ship. Atmos Chem Phys 18:9283–9292. https://doi.org/10.5194/ACP-18-9283-2018
Hsieh TC, Ma KH, Chao A (2016) iNEXT: An R package for interpolation and extrapolation of species diversity (Hill numbers). Methods Ecol Evol 7:1451–1456. https://doi.org/10.1111/2041-210X.12613
Inoue J, Hirai J, Ikeba K, Yu Z, Ahmed SI, Lin Z, Lin Y, Wong MK-S, Shinzato C, Itoh S, Ito S, Saito H, Hyodo S (2025, submitted) Ocean eDNA Map: a metabarcoding web tool for comparing marine biodiversity with special reference to teleost fish. Mol Ecol Resour
Inui R, Akamatsu Y, Kono T, Saito M, Miyazono S, Nakao R (2021) Spatiotemporal changes of the environmental DNA concentrations of amphidromous fish Plecoglossus altivelis altivelis in the spawning grounds in the Takatsu River, Western Japan. Front Ecol Evol 9:622149. https://doi.org/10.3389/fevo.2021.622149
Ishak N, Tadokoro K, Okazaki Y, Kakehi S, Suyama S, Takahashi K (2020) Distribution, biomass, and species composition of salps and doliolids in the Oyashio-Kuroshio transitional region: potential impact of massive bloom on the pelagic food web. J Oceanogr 76:351–363. https://doi.org/10.1007/s10872-020-00549-3
Jac C, Desroy N, Duchêne JC, Foveau A, Labrune C, Lescure L, Vaz S (2021) Assessing the impact of trawling on benthic megafauna: comparative study of video surveys vs. scientific trawling. ICES J Mar Sci 78:1636–1649. https://doi.org/10.1093/icesjms/fsab033
Jackman JM, Benvenuto C, Coscia I, Carvalho CO, Ready JS, Boubli JP, Magnusson WE, McDevitt AD, Sales NG (2021) eDNA in a bottleneck: Obstacles to fish metabarcoding studies in megadiverse freshwater systems. Environ DNA 3:837–849. https://doi.org/10.1002/edn3.191
Jackson J, Kirby M, Berger W, Bjorndal K, Botsford L, Bourque B, Bradbury R, Cooke R, Erlandson J, Estes J, Hughes T, Kidwell S, Lange C, Lenihan H, Pandolfi J, Peterson C, Steneck R, Tegner M, Warner R (2001) Historical overfishing and the recent collapse of coastal ecosystems. Science 293:629–637. https://doi.org/10.1126/science.1059199
Jeunen G, Knapp M, Spencer H, Lamare M, Taylor H, Stat M, Bunce M, Gemmell N (2019) Environmental DNA (eDNA) metabarcoding reveals strong discrimination among diverse marine habitats connected by water movement. Mol Ecol Resour 19:426–438. https://doi.org/10.1111/1755-0998.12982
Jo T, Minamoto T (2020) Complex interactions between environmental DNA (eDNA) state and water chemistries on eDNA persistence suggested by meta-analyses. Mol Ecol Resour 21:1234–1245. https://doi.org/10.1111/1755-0998.13354
Jo T, Murakami H, Yamamoto S, Masuda R, Minamoto T (2019) Effect of water temperature and fish biomass on environmental DNA shedding, degradation, and size distribution. Ecol Evol 9:1135–1146. https://doi.org/10.1002/ece3.4802
Jo T, Murakami H, Masuda R, Minamoto T (2020) Selective collection of long fragments of environmental DNA using larger pore size filter. Sci Total Environ 735:139462. https://doi.org/10.1016/j.scitotenv.2020.139462
Kara AB, Rochford PA, Hurlburt HE (2000) An optimal definition for ocean mixed layer depth. J Geophys Res 105(C7):16803–16821. https://doi.org/10.1029/2000JC900072
Kawakami T, Yamazaki A, Asami M, Goto Y, Yamanaka H, Hyodo S, Ueno H, Kasai A (2023) Evaluating the sampling effort for the metabarcoding-based detection of fish environmental DNA in the open ocean. Ecol Evol 13:e9921. https://doi.org/10.1002/ece3.9921
Kennedy SR, Prost S, Overcast I, Rominger AJ, Gillespie RG, Krehenwinkel H (2020) High-throughput sequencing for community analysis: the promise of DNA barcoding to uncover diversity, relatedness, abundances and interactions in spider communities. Dev Genes Evol 230:185–201. https://doi.org/10.1007/s00427-020-00652-x
Kida S, Mitsudera H, Aoki S, Guo X, Ito S, Kobashi F, Komori N, Kubokawa A, Miyama T, Morie R, Nakamura H, Nakamura T, Nakano H, Nishigaki H, Nonaka M, Sasaki H, Sasaki YN, Suga T, Sugimoto S, Taguchi B, Takaya K, Tozuka T, Tsujino H, Usui N (2015) Oceanic fronts and jets around Japan: a review. J Oceanogr 71(5):469–497. https://doi.org/10.1007/s10872-015-0283-7
Kolde R (2019) pheatmap: Pretty heatmaps. R package version 1.0.12. https://CRAN.R-project.org/package=pheatmap
Lamb PD, Hunter E, Pinnegar JK, Creer S, Davies RG, Taylor MI (2019) How quantitative is metabarcoding: a meta-analytical approach. Mol Ecol 28(2):420–430. https://doi.org/10.1111/mec.14920
Li J, Handley L, Read D, Hänfling B (2018) The effect of filtration method on the efficiency of environmental DNA capture and quantification via metabarcoding. Mol Ecol Resour 18:1102–1114. https://doi.org/10.1111/1755-0998.12899
Li C, Long H, Yang S, Zhang Y, Tang F, Jin W, Wang G, Chang W, Pi Y, Gao L, Ma L, Zhao M, Zheng H, Gong Y, Liu Y, Jiang K (2022) eDNA assessment of pelagic fish diversity, distribution, and abundance in the central Pacific Ocean. Reg Stud Mar Sci 56:102661. https://doi.org/10.1016/j.rsma.2022.102661
Lin Y, Yu Z, Ahmed SI, Wang X, Higuchi T, Yabe I, Wong MK-S, Tsutsumi E, Itoh S, Saito H, Komatsu K, Tsuda A, Kawaguchi Y, Oka E, Obata H, Minegishi Y, Fukuda H, Inoue J, Hyodo S, Ito S (2025, submitted) Opposite latitudinal biodiversity gradient in the open ocean generated by transition zones: insights from fish distribution in the Northwest Pacific. Prog Oceanogr
Littlefair JE, Hrenchuk LE, Blanchfield PJ, Rennie MD, Cristescu ME (2020) Thermal stratification and fish thermal preference explain vertical eDNA distributions in lakes. Mol Ecol 32(6):1764–1776. https://doi.org/10.1111/mec.15623
Lowerre-Barbieri S, DeCelles G, Pepin P, Catalán I, Muhling B, Erisman B, Cadrin S, Alós J, Ospina-Alvarez A, Stachura M, Tringali M, Burnsed S, Paris-Limouzy C (2017) Reproductive resilience: a paradigm shift in understanding spawner-recruit systems in exploited marine fish. Fish Fish 18:285–312. https://doi.org/10.1111/FAF.12180
Maechler M, Rousseeuw P, Struyf A, Hubert M, Hornik K (2023) cluster: Cluster analysis basics and extensions. R package version 2.1.4. https://CRAN.R-project.org/package=cluster
Mari X, Passow U, Migon C, Burd A, Legendre L (2017) Transparent exopolymer particles: effects on carbon cycling in the ocean. Prog Oceanogr 151:13–37. https://doi.org/10.1016/J.POCEAN.2016.11.002
Martin A, Boyd P, Buesseler K, Cetinic I, Claustre H, Giering S, Henson S, Irigoien X, Kriest I, Memery L, Robinson C, Saba G, Sanders R, Siegel D, Villa-Alfageme M, Guidi L (2020) The oceans’ twilight zone must be studied now, before it is too late. Nature 580:26–28. https://doi.org/10.1038/d41586-020-00915-7
Miller P, Christodoulou S (2014) Frequent locations of oceanic fronts as an indicator of pelagic diversity: application to marine protected areas and renewables. Mar Policy 45:318–329. https://doi.org/10.1016/j.marpol.2013.09.009
Miloslavich P, Bax N, Simmons S, Klein E, Appeltans W, Aburto-Oropeza O, Garcia M, Batten S, Benedetti-Cecchi L, Checkley D, Chiba S, Duffy J, Dunn D, Fischer A, Gunn J, Kudela R, Marsac F, Muller-Karger F, Obura D, Shin Y (2018) Essential ocean variables for global sustained observations of biodiversity and ecosystem changes. Glob Change Biol 24:2416–2433. https://doi.org/10.1111/gcb.14108
Miya M (2022) Environmental DNA metabarcoding: a novel method for biodiversity monitoring of marine fish communities. Annu Rev Mar Sci 14:161–185. https://doi.org/10.1146/annurev-marine-041421-082251
Miya M, Sato Y, Fukunaga T, Sado T, Poulsen JY, Sato K, Minamoto T, Yamamoto S, Yamanaka H, Araki H, Kondoh M, Iwasaki W (2015) MiFish, a set of universal PCR primers for metabarcoding environmental DNA from fishes: detection of more than 230 subtropical marine species. R Soc Open Sci 2:150088. https://doi.org/10.1098/rsos.150088
Miya M, Gotoh RO, Sado T (2020) MiFish metabarcoding: a high-throughput approach for simultaneous detection of multiple fish species from environmental DNA and other samples. Fish Sci 86:939–970. https://doi.org/10.1007/s12562-020-01461-x
Miyata K, Inoue Y, Amano Y, Nishioka T, Nagaike T, Kawaguchi T, Morita O, Yamane M, Honda H (2022) Comparative environmental RNA and DNA metabarcoding analysis of river algae and arthropods for ecological surveys and water quality assessment. Sci Rep 12:19828. https://doi.org/10.1038/s41598-022-23888-1
Montenegro D, Astudillo-García C, Hickey T, Lear G (2020) A non-invasive method to monitor marine pollution from bacterial DNA present in fish skin mucus. Environ Pollut 263:114438. https://doi.org/10.1016/j.envpol.2020.114438
Montoto-Martínez T, Hernández-Brito J, Gelado-Caballero M (2020) Pump-underway ship intake: an unexploited opportunity for Marine Strategy Framework Directive (MSFD) microplastic monitoring needs on coastal and oceanic waters. PLoS ONE. https://doi.org/10.1371/journal.pone.0232744
Nelson JS, Grande TC, Wilson MVH (2016) Fishes of the world, 5th edn. Wiley
Met Office (2010–2023) Cartopy: a cartographic python library with a Matplotlib interface. https://scitools.org.uk/cartopy/docs/latest/
Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D, Minchin PR, O’Hara RB, Simpson GL, Solymos P, Stevens MHH, Szoecs E, Wagner H (2022) vegan: Community Ecology Package. R package version 2.6-4. https://cran.r-project.org/web/packages/vegan/index.html
Orlov A, Tokranov A (2019) Checklist of deep-sea fishes of the Russian northwestern Pacific Ocean found at depths below 1000 m. Prog Oceanogr 176:102143. https://doi.org/10.1016/j.pocean.2019.102143
Pastor Rollan A, Sherman C, Ellis M, Tuohey K, Vennell R, Foster-Thorpe C, Treml E (2024) Current trends in biophysical modeling of eDNA dynamics for the detection of marine species. Environ DNA. https://doi.org/10.1002/edn3.70021
Poloczanska E, Brown C, Sydeman W, Kiessling W, Schoeman D, Moore P, Brander K, Bruno J, Buckley L, Burrows M, Duarte C, Halpern B, Holding J, Kappel C, O’Connor M, Pandolfi J, Parmesan C, Schwing F, Thompson S, Richardson A (2013) Global imprint of climate change on marine life. Nat Clim Change 3:919–925. https://doi.org/10.1038/nclimate1958
Pompanon F, Deagle BE, Symondson WO, Brown DS, Jarman SN, Taberlet P (2012) Who is eating what: diet assessment using next generation sequencing. Mol Ecol 21(8):1931–1950. https://doi.org/10.1111/j.1365-294X.2011.05403.x
Python Software Foundation (2023) Python language reference, version 3.10.9. https://www.python.org
R Core Team (2023) R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
Roblet S, Priouzeau F, Gambini G, Dérijard B, Sabourault C (2024) Primer set evaluation and sampling method assessment for the monitoring of fish communities in the North-western part of the Mediterranean Sea through eDNA metabarcoding. Environ DNA. https://doi.org/10.1002/edn3.554
Różański R, Trenkel V, Lorance P, Valentini A, Dejean T, Pellissier L, Eme D, Albouy C (2022) Disentangling the components of coastal fish biodiversity in southern Brittany by applying an environmental DNA approach. Environ DNA 4(4):920–939. https://doi.org/10.1002/edn3.305
Saito H (2019) The Kuroshio: its recognition, scientific activities and emerging issues. In: Nagai T, Saito H, Suzuki K, Takahashi M (eds) Kuroshio current: physical, biogeochemical, and ecosystem dynamics. AGU-Wiley
Sakata M, Watanabe T, Maki N, Ikeda K, Kosuge T, Okada H, Yamanaka H, Sado T, Miya M, Minamoto T (2021) Determining an effective sampling method for eDNA metabarcoding: a case study for fish biodiversity monitoring in a small, natural river. Limnology 22:221–235. https://doi.org/10.1007/s10201-020-00645-9
Sandoval Gallardo S, Fossile T, Herbst DF, Begossi A, Silva LG, Colonese AC (2021) 150 years of anthropogenic impact on coastal and ocean ecosystems in Brazil revealed by historical newspapers. Ocean Coast Manag 209:105662. https://doi.org/10.1016/j.ocecoaman.2021.105662
Sassoubre L, Yamahara K, Gardner L, Block B, Boehm A (2016) Quantification of environmental DNA (eDNA) shedding and decay rates for three marine fish. Environ Sci Technol 50(19):10456–10464. https://doi.org/10.1021/ACS.EST.6B03114
Sato S, Nakashima A, Yamaya K, Urawa S (2018a) Geographical origins of juvenile chum salmon migrating along the Pacific coast of Hokkaido, Japan, during early summer. NPAFC Tech Rep. 11:24–26. https://doi.org/10.23849/npafctr11/24.26
Sato Y, Miya M, Fukunaga T, Sado T, Iwasaki W (2018b) MitoFish and MiFish pipeline: a mitochondrial genome database of fish with an analysis pipeline for environmental DNA metabarcoding. Mol Biol Evol 35(6):1553–1555. https://doi.org/10.1093/molbev/msy074
Sato M, Yamamoto S, Sado T, Minamoto T (2021) Quantitative assessment of multiple fish species around artificial reefs combining environmental DNA metabarcoding and acoustic survey. Sci Rep 11(1):2753. https://doi.org/10.1038/s41598-021-98926-5
Schenekar T, Schletterer M, Lecaudey L, Weiss S (2020) Reference databases, primer choice, and assay sensitivity for environmental metabarcoding: Lessons learnt from a re-evaluation of an eDNA fish assessment in the Volga headwaters. River Res Appl 36:1004–1013. https://doi.org/10.1002/rra.3610
Schmelzle M, Kinziger A (2016) Using occupancy modelling to compare environmental DNA to traditional field methods for regional-scale monitoring of an endangered aquatic species. Mol Ecol Resour 16:895–908. https://doi.org/10.1111/1755-0998.12501
Sepulveda AJ, Hutchins PR, Forstchen M, McKeefry MN, Swigris AM (2020) The elephant in the lab (and field): contamination in aquatic environmental DNA studies. Front Ecol Evol 8:609973. https://doi.org/10.3389/fevo.2020.609973
Skelton J, Cauvin A, Hunter ME (2023) Environmental DNA metabarcoding read numbers and their variability predict species abundance, but weakly in non-dominant species. Environ DNA 5:1092–1104. https://doi.org/10.1002/edn3.355
Stat M, Huggett MJ, Bernasconi R et al (2017) Ecosystem biomonitoring with eDNA: Metabarcoding across the tree of life in a tropical marine environment. Sci Rep 7:12240. https://doi.org/10.1038/s41598-017-12501-5
Stat M, John J, DiBattista J, Newman S, Bunce M, Harvey E (2019) Combined use of eDNA metabarcoding and video surveillance for the assessment of fish biodiversity. Conserv Biol 33:196–205. https://doi.org/10.1111/cobi.13183
Stauffer S, Jucker M, Keggin T, Marques V, Andrello M, Bessudo S, Cheutin MC, Borrero-Pérez GH, Richards E, Dejean T, Hocdé R, Juhel JB, Ladino F, Letessier TB, Loiseau N, Maire E, Mouillot D, Mutis Martinezguerra M, Manel S, Waldock C (2021) How many replicates to accurately estimate fish biodiversity using environmental DNA on coral reefs? Ecol Evol 11:14630–14643. https://doi.org/10.1002/ece3.8150
Stehfest KM, Carter CG, McAllister JD, Ross JD, Semmens JM (2017) Response of Atlantic salmon Salmo salar to temperature and dissolved oxygen extremes established using animal-borne environmental sensors. Sci Rep 7:4545. https://doi.org/10.1038/s41598-017-04806-2
Stoeckle MY, Adolf J, Charlop-Powers Z, Dunton KJ, Hinks G, VanMorter SM (2021) Trawl and eDNA assessment of marine fish diversity, seasonality, and relative abundance in coastal New Jersey, USA. ICES J Mar Sci 78(1):293–304. https://doi.org/10.1093/icesjms/fsaa225
Stramski D, Reynolds R, Gernez P, Röttgers R, Wurl O (2019) Inherent optical properties and particle characteristics of the sea-surface microlayer. Prog Oceanogr 169:13–33. https://doi.org/10.1016/J.POCEAN.2019.05.009
Strickler KM, Fremier AK, Goldberg CS (2015) Quantifying effects of UV-B, temperature, and pH on eDNA degradation in aquatic microcosms. Biol Conserv 183:85–92. https://doi.org/10.1016/j.biocon.2014.11.038
Suzuki J, Nakano D, Kobayashi S (2022) Characteristics of diurnal and seasonal changes in fish detection patterns using environmental DNA metabarcoding in a mountain stream. Limnologica 93:125955. https://doi.org/10.1016/j.limno.2022.125955
Takahara T, Minamoto T, Yamanaka H, Doi H, Kawabata Z (2019) Using environmental DNA to estimate the seasonal distribution and habitat preferences of a Japanese basket clam in Lake Shinji, Japan. Estuar Coast Shelf Sci 219:230–239. https://doi.org/10.1016/j.ecss.2019.02.022
Thomsen P, Kielgast J, Iversen L, Møller P, Rasmussen M, Willerslev E (2012) Detection of a diverse marine fish fauna using environmental DNA from seawater samples. PLoS ONE 7:e41732. https://doi.org/10.1371/journal.pone.0041732
Thornton DCO (2014) Dissolved organic matter (DOM) release by phytoplankton in the contemporary and future ocean. Eur J Phycol 49(1):20–46. https://doi.org/10.1080/09670262.2013.875596
Trusenkova O, Ishida H (2005) Seasonal variation of surface and deep currents in the Japan Sea. Doboku Gakkai Ronbunshu 2005:93–111. https://doi.org/10.2208/JSCEJ.2005.796_93
Vipin PM, Ravi R, Pradeep K, Fernandez TJ, Madhu VR, Remesan MP, Boopendranath MR (2015) Length–weight relationship of Myctophum spinosum (Steindachner, 1867) caught off the south-west coast of India. Fish Technol 52:187–190
Watanabe H, Kawaguchi K, Hayashi A (2002) Feeding habits of juvenile surface-migratory myctophid fishes (family Myctophidae) in the Kuroshio region of the western North Pacific. Mar Ecol Prog Ser 236:263–272. https://doi.org/10.3354/meps236263
Watari S, Murase H, Yonezaki S, Okazaki M, Kiyofuji H, Tamura T, Hakamada T, Kanaji Y, Kitakado T (2019) Ecosystem modeling in the western North Pacific using Ecopath, with a focus on small pelagic fishes. Mar Ecol Prog Ser. https://doi.org/10.3354/MEPS12508
West KM, Stat M, Harvey ES, Skepper CL, DiBattista JD, Richards ZT, Travers MJ, Newman SJ, Bunce M (2020) eDNA metabarcoding survey reveals fine-scale coral reef community variation across a remote, tropical island ecosystem. Mol Ecol 29(6):1069–1086. https://doi.org/10.1111/mec.15382
Wickham H (2016) ggplot2: Elegant graphics for data analysis. Springer, New York. https://ggplot2.tidyverse.org
Wong MK, Nakao M, Hyodo S (2020) Field application of an improved protocol for environmental DNA extraction, purification, and measurement using Sterivex filter. Sci Rep 10:21531. https://doi.org/10.1038/s41598-020-77304-7
Woodson C, Litvin S (2015) Ocean fronts drive marine fishery production and biogeochemical cycling. Proc Natl Acad Sci 112:1710–1715. https://doi.org/10.1073/pnas.1417143112
Wurl O, Holmes M (2008) The gelatinous nature of the sea-surface microlayer. Mar Chem 110(1–2):89–97. https://doi.org/10.1016/j.marchem.2008.02.009
Wurl O, Miller L, Röttgers R, Vagle S (2009) The distribution and fate of surface-active substances in the sea-surface microlayer and water column. Mar Chem 115:1–9. https://doi.org/10.1016/J.MARCHEM.2009.04.007
Wurl O, Miller L, Vagle S (2011) Production and fate of transparent exopolymer particles in the ocean. J Geophys Res Oceans. https://doi.org/10.1029/2011JC007342
Xing Q, Yu H, Wang H, Ito S, Chai F (2023) Mesoscale eddies modulate the dynamics of human fishing activities in the global midlatitude ocean. Fish Fish 24(4):527–543. https://doi.org/10.1111/faf.12742
Yamamoto S, Masuda R, Sato Y, Sado T, Araki H, Kondoh M, Minamoto T, Miya M (2017) Environmental DNA metabarcoding reveals local fish communities in a species-rich coastal sea. Sci Rep 7:40368. https://doi.org/10.1038/srep40368
Yasuda I (2003) Hydrographic structure and variability in the Kuroshio-Oyashio transition area. J Oceanogr 59:389–402. https://doi.org/10.1023/A:1025580313836
Yoo JT, Nakata H, Sugimoto T (2004) Effect of horizontal advection induced by intrusion of the Kuroshio water on plankton biomass in the spring fishing period of shirasu on the Pacific coast of Japan. Fish Sci 70(6):937–944. https://doi.org/10.1111/j.1444-2906.2004.00892.x
Yoshida T, Kawato M, Fujiwara Y, Nagano Y, Tsuchida S, Yabuki A (2023) Optimization of environmental DNA analysis using pumped deep-sea water for the monitoring of fish biodiversity. Front Mar Sci 9:965800. https://doi.org/10.3389/fmars.2022.965800
Yu Z, Ito S, Wong MK, Yoshizawa S, Inoue J, Itoh S, Yukami R, Ishikawa K, Guo C, Ijichi M, Hyodo S (2022) Comparison of species-specific qPCR and metabarcoding methods to detect small pelagic fish distribution from open ocean environmental DNA. PLoS ONE 17:e0273670. https://doi.org/10.1371/journal.pone.0273670
Yu Z, Wong MK, Inoue J, Ahmed SI, Higuchi T, Hyodo S, Itoh S, Komatsu K, Saito H, Ito S (2023) Environmental DNA in the Kuroshio reveals environment-dependent distribution of economically important small pelagic fish. Front Mar Sci 10:1121088. https://doi.org/10.3389/fmars.2023.1121088
Zamanillo M, Ortega-Retuerta E, Nunes S, Estrada M, Sala M, Royer S, López-Sandoval D, Emelianov M, Vaqué D, Marrasé C, Simó R (2019) Distribution of transparent exopolymer particles (TEP) in distinct regions of the Southern Ocean. Sci Total Environ 691:736–748. https://doi.org/10.1016/j.scitotenv.2019.06.524
Zhang H, Yoshizawa S, Iwasaki W, Xian W (2019) Seasonal fish assemblage structure using environmental DNA in the Yangtze Estuary and its adjacent waters. Front Mar Sci 6:671. https://doi.org/10.3389/fmars.2019.00515
Zhang Y, Pavlovska M, Stoica E, Prekrasna I, Yang J, Slobodnik J et al (2020) Holistic pelagic biodiversity monitoring of the Black Sea via eDNA metabarcoding approach: from bacteria to marine mammals. Environ Int 135:105307. https://doi.org/10.1016/j.envint.2019.105307
Zou K, Chen J, Ruan H, Li Z, Guo W, Li M, Liu L (2020) eDNA metabarcoding as a promising conservation tool for monitoring fish diversity in a coastal wetland of the Pearl River Estuary compared to bottom trawling. Sci Total Environ 702:134704. https://doi.org/10.1016/j.scitotenv.2019.134704
Acknowledgements
The authors would like to express their sincere gratitude to the crew members and captains of the R/V Shinsei-maru and R/V Hakuho Maru for their invaluable assistance in sample collection during all nine research cruises. We also extend our appreciation to the researchers who contributed to the sampling efforts during these voyages. In particular, we would like to extend our special thanks to Prof. Koji Hamasaki, the chief scientist of the KH-23-3 cruise aboard R/V Hakuho Maru, for his exceptional cooperation and support in coordinating smooth sampling operations and facilitating experimental procedures. The cruises were supported by the Cooperative Research Program of the Atmosphere and Ocean Research Institute, The University of Tokyo (Research Vessels Hakuho-maru, JURCAOSSH23-17, and Shinsei-Maru, JURCAOSS22-21, JURCAOSS21-28, JURCAOSS21-07, JURCAOSS21-32, JURCAOSS20-31, JURCAOSS22-26, and JURCAOSS21-34). In addition, we would also like to express our special thanks to the editor and the three anonymous reviewers for their constructive comments and valuable suggestions, which greatly improved the quality of this manuscript. The first author wishes to acknowledge the Ministry of Education, Culture, Sports, Science and Technology of Japan for their financial support throughout his doctoral studies. Furthermore, the first author is deeply grateful to Dr. Yabe Itsuka for her invaluable guidance during his doctoral research.
Funding
Open Access funding provided by The University of Tokyo. This research was supported by: 1. SII, JP21H04735, JP22H05030, and JP25H02072, The Japan Society for the Promotion of Science (JSPS) KAKENHI, https://www.jsps.go.jp/english/. 2. SII and SH, The OceanDNA project, The University of Tokyo Future Society Initiative, https://www.u-tokyo.ac.jp/adm/fsi/ja/projects/sdgs/projects_00103.html. 3. SI, ET and SII, Integrative research on transition zones between coastal and offshore waters for resource reserve and sustainable use, The University of Tokyo Future Society Initiative, https://www.u-tokyo.ac.jp/adm/fsi/en/projects/sdgs/projects_00176.html. 4. JM and SI, Open application for joint usage and cooperative research at Kashiwa, Atmosphere and Ocean Research Institute, The University of Tokyo, https://www.aori.u-tokyo.ac.jp/english/coop/index.html. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author information
Authors and Affiliations
Contributions
Conceptualization: Sk Istiaque Ahmed; methodology: Sk Istiaque Ahmed, Zeshu Yu, Marty Kwok-Shing Wong, Tomihiko Higuchi, Xueding Wang, Yuan Lin, Jun Inoue; formal analysis and investigation: Sk Istiaque Ahmed; writing—original draft preparation: Sk Istiaque Ahmed; writing—review and editing: Shin-ichi Ito, John Morrongiello, El Mahdi Bendif, Sachihiko Itoh, Kosei Komatsu, Eisuke Tsutsumi, Marty Kwok-Shing Wong, Tomihiko Higuchi, Jun Inoue, Susumu Hyodo, Hideki Fukuda; funding acquisition, resources and supervision: Shin-ichi Ito.
Corresponding authors
Ethics declarations
Ethical approval
This manuscript has not been published or presented elsewhere in part or in entirety and is not under consideration by another journal.
Conflict of interest
The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ahmed, S.I., Yu, Z., Higuchi, T. et al. Patterns in marine surface fish biodiversity and community composition detected by different eDNA metabarcoding sampling methods. J Oceanogr (2025). https://doi.org/10.1007/s10872-025-00771-x
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10872-025-00771-x