Swearson Presentation

Embed Size (px)

Citation preview

  • 8/14/2019 Swearson Presentation

    1/20

    Harmful Algal Bloom forecasting feasibility of

    Monterrey Bay using remote sensing Data and Data

    Extraction Software, Weka.

    SARP 2009

    William Swearson

  • 8/14/2019 Swearson Presentation

    2/20

    Objective

    Attempt to determine thefeasibility of forecasting

    algal blooms in

    Monterey Bay. Successful forecasting

    models have been developed

    for the East Coast, Gulf of

    Mexico as well as in the

    coastal waters near HongKong.

    Monterey Bay presents

    certain difficulties that make it

    unique to forecast.mas.arc.nasa.gov/gallery.html

  • 8/14/2019 Swearson Presentation

    3/20

    What are Algal Blooms? Algae are unicellular,

    sometimes multicellular,autotrophic organisms.

    Blooms are a rapidincrease in algae in a

    marine environment. Under the right

    conditions; favorabletemperature, solar

    radiation, nutrientconcentration, windspeed, and tidal flushingalgae blooms can

    flourish (Lee, Huang, Dickman,Ja awardena, 2002 .

  • 8/14/2019 Swearson Presentation

    4/20

    So why study Algae Blooms?

    Forecasting algae

    blooms=world peace

  • 8/14/2019 Swearson Presentation

    5/20

    So why study Algae Blooms?

    Massachusetts PSP closures as of

    June 25, 2009

    http://www.whoi.edu/page.do?pid=23996&tid=441&cid=69708&ct=61&article=13371

    Algal blooms have

    increased their global

    distribution, their

    frequency, duration, and

    severity of their effects. Some of these algal

    species are called

    HABs, Harmful Algal

    Blooms. Produce a toxin called

    domoic acid.

  • 8/14/2019 Swearson Presentation

    6/20

    Methodology

    Forecasting is beingattempted in MontereyBay using direct andindirect measurements.

    I wanted to look at thepossibilities offorecasting by looking atMASTER data.

    This type projectrequired a large volumeof data at different lagtimes.

    Environmental Sample

    Processor, or "ESP."

  • 8/14/2019 Swearson Presentation

    7/20

    Methodology However we ran into

    unexpected problemsduring MASTER datacollection

    Much was still learned

    despite our setbacks.Eagles may soar, but

    weasels dont getsucked into jet enginesSteven Wright

    During the rest of theflight Clint and Idiligently thought aboutwhat else could be

    done.

  • 8/14/2019 Swearson Presentation

    8/20

    Methodology Dr. John Ryan and Dr. Nick

    Clinton were able to provideme with data from the boatthat took directmeasurements of MontereyBay.

    Excel files contained data onchlorophyll reflectance, seasurface temps, salinity andvarious bands.

    These would be pumpedinto data extractingsoftware, Weka, to look forcorrelations between thedata sets.

  • 8/14/2019 Swearson Presentation

    9/20

    Setup

    Y=G(x) Y, chlorophyll

    Divided into classes, i.e. {high, med,low}

    X, MASTER Bands and Ship Data Subjectively chosen

    Bands 2,12,15,18,22,45,48,49, Salinity, Temp

    G, some unknown function Neural Network

    Multi-Layer Perceptron

    Classification Tree J48

    SimpleCart

  • 8/14/2019 Swearson Presentation

    10/20

    Weka is a collectionof machine learningalgorithms for datamining tasks.

    Uses varioustechniques Classification Trees

    Neural Networking The algorithms can

    be applied directly toa dataset.

    Weka the program is named after this

    curious New Zealand bird.

    Methodology

  • 8/14/2019 Swearson Presentation

    11/20

    What is a Neural Network? Artificial Neural

    Networks, ANNs,use the sameconcept the brainuses to make

    decisions. y=G(x)

    A non-linear datamining techniqueused to discovernon-linearrelationships withchlorophyll.

    Hidden Layer

    Output:

    Chlorophyll

    Input:

    MASTER,temp,

    salinity etc

  • 8/14/2019 Swearson Presentation

    12/20

    Looked at data fromthe Friday July, 24thflight

    4 days before aAlgal Bloom inMonterey Bay.

    Provided data intowhich

    environmentsHABs thrived in(truthing).

    Selected variousbands that hadpatterning of linearregression or arelation betweenchlorophyll.

    Scatter Plots of Chlorophyll vs.

    MASTER BANDS

  • 8/14/2019 Swearson Presentation

    13/20

    Neural Networking

    -Used the subjectively

    chosen plots and all bands.-Subjectively Chosen bands

    had a higher accuracy

    compared to all bands.

  • 8/14/2019 Swearson Presentation

    14/20

    How it works

    SimpleCart

    - Simple Classification And

    Regression Tree

    Function Selection

    Modifier- pruningAccuracy

  • 8/14/2019 Swearson Presentation

    15/20

    Weka Classification

    Tree Model

  • 8/14/2019 Swearson Presentation

    16/20

    temp < 15.244999885

    | temp < 14.676500319999999

    | | temp < 13.278600220000001| | | band8 < 8.25: low(186.0/36.0)

    | | | band8 >= 8.25: high(20.0/19.0)

    | | temp >= 13.278600220000001

    | | | band44 < 7.434999942499999

    | | | | temp < 14.278900145

    | | | | | temp < 13.9678998| | | | | | band42 < 6.2449998855: med(86.0/52.0)

    | | | | | | band42 >= 6.2449998855

    | | | | | | | band43 < 7.0749998094999995

    | | | | | | | | band49 < 7.044999838: med(27.0/10.0)

    | | | | | | | | band49 >= 7.044999838: high(68.0/55.0)

    | | | | | | | band43 >= 7.0749998094999995: high(57.0/23.0)

    | | | | | temp >= 13.9678998: med(104.0/62.0)

    | | | | temp >= 14.278900145: med(73.0/9.0)

    | | | band44 >= 7.434999942499999

    | | | | band4 < 19.150000570000003

    CART ~77% accuracy on the input data set

    Temp

    Near IR

    Thermal

  • 8/14/2019 Swearson Presentation

    17/20

    J48~ 91% accuracy on the input data sety

  • 8/14/2019 Swearson Presentation

    18/20

    Conclusion

    Developing the learning algorithms neededfor making the forecasting models is

    energetically demanding.

    The timing of the datasets is important for

    developing a predictive model.

    The model can show which variables are

    predictive (e.g. MASTER thermal and NIR

    bands). Prediction is feasible, but more data and

    testing are needed.

  • 8/14/2019 Swearson Presentation

    19/20

    Acknowledgments

    John Ryan and Nick Clinton for supplying mewith data.

    Nick Cliton again for helping me with the data. NSERC

    SARP

    Sesame Street for the a letter W and thenumber 6.

  • 8/14/2019 Swearson Presentation

    20/20

    References

    Gower, King, Borstad and Brown. 2009.Detection of intense plankton bloomsusing 709nm band of the MERIS imaging spectrometer. International Journal of

    Remote Sensing. Vol. 26, No. 9.

    Lee, Huang, Dickman, Jayawardena.Neural network modeling of coastal algal

    blooms. 2002. Ecological Modelling 159, 179-201.

    NCCOS. HAB Ecological Forecasting. Center for Sponsored Coastal OceanResearch. 2009. Center for Sponsored Coastal Ocean Research. Retrieved July

    2009.

    http://www.cop.noaa.gov/stressors/extremeevents/hab/current/ecoforecasting.htm

    l

    Oceanus. Researchers Successfully Forecast 2008 Red Tide. 2009. Woods

    Hole Oceanic Institute. Retrieved July 2009.

    http://www.whoi.edu/oceanus/viewArticle.do?id=47406