The open-source "Echostack" for scalable, cloud-native processing of water column sonar data
Water column sonar data collected by echosounders are essential for fisheries and marine ecosystem research, enabling the detection, classification, and quantification of fish and zooplankton from many different ocean observing platforms. However, the broad usage of these data has been hindered by the lack of modular software tools that allow flexible composition of data processing workflows that incorporate powerful analytical tools in the scientific Python ecosystem. We address this gap by developing Echostack, a suite of open-source Python software packages that leverage existing distributed computing and cloud-interfacing libraries to support intuitive and scalable data access, processing, and interpretation. These tools can be used individually or orchestrated together, which we demonstrate in example use cases for a fisheries acoustic-trawl survey.
Currently, the Echostack contains the following packages:
- Echopype: performs data standardization and computation from raw instrument files to acoustic data products
- Check out the Echopype paper in ICES Journal of Marine Science
- Echopop: generates acoustically derived biological estimates, such as abundance
- Learn more on Echopop project page
- Echoshader: enables interactive acoustic data visualization and exploration
- Echoregions: interfaces acoustic data with machine learning developments
- Echodataflow: workflow orchestration via text-based configuration “recipes” instead of code
These packages are accompanied by a set of data processing level definitions, Echolevels, which categorizes data products at different workflow stages to enhance data understanding and provenance tracking.
Check out Wu-Jung’s talk at SciPy 2024 and the associated proceeding paper!
Funding:
- NOAA Fisheries
- NOAA Office of Ocean Exploration and Research FY2021 grants