Skip to content

Latest commit

 

History

History
52 lines (28 loc) · 5.18 KB

README.md

File metadata and controls

52 lines (28 loc) · 5.18 KB

ITCOocean Hackweeks: Machine learning and ocean remote-sensing data

banner

Welcome

Welcome to Machine Learning based Species Distribution Modelling hosted by the International Training Centre for Operational Oceanography (ITCOocean), ESSO-INCOIS, Hyderabad, India and supported by NOAA Fisheries, NWFSC Mathematetical Biology Program via a NOAA International Fellowship.

The course and hackweek will take place at the ITCOocean training centre in Hyderabad, India from September 11-22, 2023. course announcement. The application period for the 2023 course has closed.

Course GitHub org: https://github.com/Hackweek-ITCOocean

Course Website: https://Hackweek-ITCOocean.github.io/2023-Hackbook

Overview

The the era of big data in the earth sciences is here and learning how to effectively use oceanographic remote-sensing data, both in the cloud and on your computer, is a core skill for modern fisheries science and management. Learning how to access cloud-based data, visualize these data, use these data in models, and use the tools of modern reproducible and collaborative science is the main goal of this course. Through the course, participants will gain experience with assessing remote-sensing data in the cloud, R and RStudio, and collaborating with Git and GitHub. Python users: Python is also very heavily used in big data geosciences. The course tutorials will focus on R, however you are welcome to use Python during the course if you are more comfortable with that.

Hackweek focus: species distribution modelling

In depth understanding of machine learning for species (macrofauna) distribution modelling (SDM) is needed by many young ecosystem researchers. SDMs are an important class of models used to understand species association with the environment. SDMs are widely used to understand how species distributions are changing with ocean climate change and are also used to estimate abundance over large ocean regions from fisheries survey data. This course is devised to familiarize the young professionals in the Indian Ocean-rim (IOR) countries with the latest developments in this field.

This course does not aim to teach ecology and habitat to the oceanographers/modellers/machine learning experts, rather it aims to teach machine learning and coding (and its relevance to fisheries) to fishery professionals.

Aims and Objectives

  • Learn how to discover and use oceanographic remote-sensing data for species distribution modeling and other fisheries applications
  • Familiarize participants with using remote-sensing data in R within models. Note participants who are more familiar with Python are welcome to use Python also.
  • Introduce the participants to machine learning models used in species distribution modelling.
  • Obtain hands-on experience in using species distribution modelling and machine-learning with remote-sensing data.
  • Work on a group project using species distribution modelling to estimate habitat associations or obtain a stock estimate using a species distribution model.

Note, there are many ways that participants can study aspects of species distribution using remote-sensing data. During the hackweek portion of the course, participants will jointly develop project to work on and in order to learn by diving into a specific project. Species location will not be specifically required in order to engage in the projects.

What to expect

A hackweek is a participant-driven workshop that blends data science education, community building, and project work over a short period of time (one to two weeks). The events are highly immersive and allow participants to work directly with data science professionals to co-shape projects and educational outcomes. Hackweeks often help individuals and teams engage more effectively in open and reproducible science. - eScience Institute, University of Washington

  • During the first week of the ITCOocean course, participants will work on learning how to access, visualize and manipulate remote-sensing data in R. Participants will develop familiarity with the R language, with RStudio, and with Git and GitHub for collaboration.
  • The course is an open science event: all tutorials and examples are developed openly and will be publicly available during and following the course. Participants will strengthen their practice of open science, using open source code and collaborating on their projects with course peers.

In the two to three weeks leading up to the course, participants are encouraged to review background resources that will facilitate a more effective experience.

Indian Ocean and Bay of Bengal Data

Students will learn how to access remote-sensing data in the cloud (meaning on-line), but we have also prepared an "analysis ready" data set for students. This will be available on a shared drive and we will introduce the students to the techniques for accessing large datasets without loading them into memory.