Loading…
This event has ended. Visit the official site or create your own event on Sched.
Welcome to the Summer Meeting for the Earth Science Information Partners (ESIP)! The 2017 theme is Strengthening Ties Between Observations and User Communities. The theme is based on one of the goals in the 2015 - 2020 ESIP Strategic Plan, which provides a framework for ESIP’s activities over the next three years.
  • Check out the full 2017 Summer Meeting Guide here -> http://bit.ly/ESIP_Sum_Guide_2017
  • Find a map if the Indiana Memorial Union HERE.
  • There will be lots going on in Slack during the meeting, find your invite HERE. #summer_mtg

Miss the plenary? Click here to check it out!
View analytic

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Monday, July 24
 

8:00am

Annual gathering of the LTER Information Managers
The LTER Network of 28 research sites maintains its cohesion in information management practice and technology through the ongoing communication and collaboration among all the sites' information managers via monthly online sessions and an annual in person gathering. Part seminar, part town hall, part hack-a-thon and even part group therapy, these annual meetings have prompted professional growth and a chance to give back to the community that has fostered our own initiation to this profession. Most of these sessions will be open to interested outsiders, with the single exception of our governance discussion and elections. 

Speakers | Moderators
avatar for Jason Downing

Jason Downing

Information Manager, Bonanza Creek LTER
avatar for Gastil Gastil-Buhl

Gastil Gastil-Buhl

Information Manager, Moorea Coral Reef Long Term Ecological Research
I curate datasets for an LTER site for reuse in future and current use by other research groups. I am interested in optimizing data usability and making the curation process more efficient. My favorite part of this work is the collegial spirit among LTER site information managers... Read More →


Monday July 24, 2017 8:00am - 5:00pm
5 - Oak 900 E 7th St, Bloomington, IN 47405

9:00am

sUAS Data Management Workshop
Do you like robots? Do you like flying robots? Do you like open earth science data? Do you like flying robots that collect earth science data to help solve important global challenges?
 
Then please join us in Bloomington, IN for a 1 day drone data Workshop & Hackathon on July 24, 2017 co-located with the Earth Science Information Partners (ESIP) summer meeting. The morning workshop includes short presentations by earth and information scientists from NASA, USGS, DJI, USDA, OGC and others involved in the area of science drones (AKA small unmanned aerial systems AKA sUAS AKA flying data robots). After lunch, speakers and participants will come together for a hackathon, working on projects ranging from prototyping data formating tools, through brainstorming best practices for data management, workflow design, or [your idea here].
 
Attendance is FREE (though please sign up by June 30th, 2017 at the link below) and food will be provided!
 
We are particularly hoping that researchers - including graduate students - with interest or expertise in drone use and/or data management will participate!
 
Please see the links below for a flyer, a tentative agenda, and a signup list. Contact workshop organizers Jane Wyngaard and Lindsay Barbieri for more information at [jwyngaar@nd.edu, lkbar@uvm.edu]
 
When: 24 July 2017, 9am-7pm (Tentative Agenda)
Where: https://imu.indiana.edu/event-planning/spaces/dogwood-room.html
Flyer
Signup list

Speakers | Moderators
LB

Lindsay Barbieri

University of Vermont


Monday July 24, 2017 9:00am - 5:00pm
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

1:00pm

HDF Workshop
The HDF Group is working with a number of partners to develop next-generation approaches to serving scientific data from the cloud. This session will provide hands-on experience with:


  1. The restful HDF5 Interface

  2. The S3 enabled Python library for HDF5

  3. The Highly Scalable Data Service for HDF data

Please let us know what tools you use with HDF: https://goo.gl/2fMMkU

Speakers | Moderators

Monday July 24, 2017 1:00pm - 5:00pm
4 - Maple 900 E 7th St, Bloomington, IN 47405

5:00pm

7:00pm

Bowling Night (Indiana Memorial Union)
Anyone in town for the ESIP Summer Meeting or DataONE is welcome to join us for an evening of bowling. Lanes and shoes are on us... just bring your A-game!

 
Tuesday, July 25
 

8:00am

Registration Open
Tuesday July 25, 2017 8:00am - 9:00am
East Lounge

9:00am

Welcome
Welcome and overview of the week. 

Speakers | Moderators
EL

Emily Law

Data System and Technology Deputy Program Manager, NASA / Caltech / JPL
I am serving as the President of ESIP. I look forward to talking to all participants, getting to know them better, and helping where I can.
avatar for Erin Robinson

Erin Robinson

Executive Director, ESIP


9:30am

Create EML Using R and Share the Script via the Information Management Code Repository
In this session, we will demonstrate how to use the R Package for EML generation (https://cran.r-project.org/web/packages/EML/index.html). Data managers at the Environmental Data Initiative have been using this package extensively and we have a number example R scripts available in our developing git repository (https://github.com/EDIorg/EMLtools). Participants will be shown how to use github from within R Studio and can eventually share their R scripts via the Information Management Code library. Everyone is encouraged to bring laptops and follow along. R Studio needs to be installed and a basic level of R experience is expected.

The IM Code Repository is the subject of the session “Developing the Information Management Code Repository” at this meeting.

Speakers | Moderators

Tuesday July 25, 2017 9:30am - 11:00am
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

9:30am

Practical Exploration of Strategies for Improving Science Communication
In situations as diverse as giving a presentation to colleagues, communicating the value of your work to potential funders, or simply telling someone you've just met about what you do, a range of strategies could help improve the effectiveness of your communication.

In this session, we'll introduce several communication strategies designed to enhance audience understanding across a range of purposes, and then try them out.

Speakers | Moderators
avatar for LuAnn Dahlman

LuAnn Dahlman

Science Writer and Editor, NOAA Climate Program Office
avatar for Denise Hills

Denise Hills

Director, Energy Investigations, Geological Survey of Alabama
Long tail data, data preservation, connecting physical samples to digital information, geoscience policy, science communication


Tuesday July 25, 2017 9:30am - 11:00am
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

9:30am

Insight into Open Source Semantic Technlogies at The Apache Software Foundation
This workshop will introduce people to two key open source semantic technologies (Apache CommonsRDF and Apache Any23) developed and maintained at the Apache Software Foundation; a software foundation fostering software development upon which billions of users depend on free, community-driven software.

This workshop will cover both
• Apache Commons RDF; Commons RDF aims to provide a common library for RDF 1.1 that could be implemented by systems on the Java Virtual Machine. https://commons.apache.org/proper/commons-rdf/, and
• Apache Any23; a library, a web service and a command line tool that extracts structured data in RDF format from a variety of Web documents. https://any23.apache.org/

This workshop will furnish attendees with everything required to generate Earth Science data in RDF and then use that data in a wide variety of applications and scenarios.
The session will begin by introducing how to generate RDF data with CommonsRDF including Introduction, API’s, implementations and then a worked example/code examples for generating Earth Sciences RDF data.
The second half of the session will show how RDF data can be embedded in Webpages, e.g. Dataset Landing Pages, and then interpreted using the Any23
Library. This will cover an Any23 Introduction, API’s and then a worked example/code examples for working with RDF data.

All of the examples and code will be available on the ESIP Github area.

Speakers | Moderators
avatar for Beth Huffer

Beth Huffer

Lingua Logica
avatar for Lewis John McGibbney

Lewis John McGibbney

DAAC Engineer, PO.DAAC
My name is Lewis John McGibbney, I am currently a Data Scientist at the NASA Jet Propulsion Laboratory in Pasadena, California where I work in Computer Science and Data Intensive Applications. I enjoy floating up and down the tide of technologies @TheASF having a real enthusiasm... Read More →



Tuesday July 25, 2017 9:30am - 11:00am
3 - Georgian 900 E 7th St, Bloomington, IN 47405

9:30am

HDF Analytics
HDF data from many scientific disciplines can be integrated into analytical tools for analysis and visualization on desktops and over the network. Analytical results can be saved to HDF for archive or distribution.

Please let us know what tools you use with HDF: https://goo.gl/2fMMkU

Speakers | Moderators

Tuesday July 25, 2017 9:30am - 11:00am
4 - Maple 900 E 7th St, Bloomington, IN 47405

9:30am

Usability Cluster “Work-a-thon”
The Usability Cluster was re-initiated at the ESIP Summer Meeting 2016, and during the first year of the Cluster’s reactivation, the Cluster has been working in a number of different areas, including learning about various usability evaluation techniques, discussing usability issues at data repositories, and assessing usability of several data services/tools as use cases. Ultimately, building on the lessons learned from these activities, the Usability Cluster is working on developing and introducing a Usability Test Framework that could be used when designing and applying usability evaluations for Earth Science-centric data repositories/services.

During this session, the Usability Cluster members and anyone else who is interested will have a hands-on opportunity to review the current progress, create content, and recommend feedback/future opportunities for the Usability Test Framework. 

The session agenda is included below:

  • Report Out from the RMap team regarding the team's experience with the Framework (~5 minutes)
    • They won’t have performed any usability tests at this point, but can talk about the process of developing the test using the framework.
  • Introduction of the Usability Test Framework (5 - 10 minutes)
  • Creation of a new test script for the Data Management Training Clearinghouse as a demonstration of the Framework (~45 minutes)
    • Introduction to the Data Management Training Clearinghouse
    • Explanation of goals for performing usability testing using the Data Management Training Clearinghouse
  • Critique of the Current Content and Discussion of the next steps for the Framework (15 minutes)
  • Completion the "Post-Test Reflection" section (can be reserved for the next monthly call as needed)

Speakers | Moderators
SH

Sophie Hou

Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.
avatar for Madison Langseth

Madison Langseth

U.S. Geological Survey


Tuesday July 25, 2017 9:30am - 11:00am
5 - Oak 900 E 7th St, Bloomington, IN 47405

9:30am

Introduction to Big Data Analytics using NEXUS
NEXUS is an emerging open source technology that enables horizontal processing of geospatial, array-based data for cluster and cloud environments. This workshop provides an introduction to NEXUS and hands-on exercises using Jupyter notebook to interact with NEXUS.

Speakers | Moderators

Tuesday July 25, 2017 9:30am - 11:00am
6 - Walnut 900 E 7th St, Bloomington, IN 47405

11:00am

Networking Break
Tuesday July 25, 2017 11:00am - 11:30am
Conference Lounge

11:30am

Everything you wanted to know about climate but were afraid to ask
Joint recently trained Climate Reality Leader, Ruth Duerr for a short workshop on the topic of the climate. In this workshop we will look at the latest data and news on the science, economics and politics of climate change. Suggestions for activities for students and adults will be described. Questions not answered in real-time will be posted after the event.

Speakers | Moderators
avatar for Ruth Duerr

Ruth Duerr

data; identifiers; citations; discovery and access


Tuesday July 25, 2017 11:30am - 1:00pm
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

11:30am

Hands on session: using PID Testbed to evaluate RDA PID services
From the Research Data Alliance several services have emerged recently as components of a persistent ID (PID) enabled architecture. Recently funded testbed in the US assembles these component pieces into one place, and gives researchers the opportunity through a guided hands-on session to evaluate the services for adoption into their own projects.

Speakers | Moderators
avatar for Beth Plale

Beth Plale

Co-PI HathiTrust Research Center, Indiana University
Science Director, Pervasive Technology Institute | Director, Data To Insight Center | Professor, Informatics and Computing | Indiana University


Tuesday July 25, 2017 11:30am - 1:00pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

11:30am

Talking to THREDDS in Python with Siphon
Attendees will learn how to use the Siphon Python library to communicate with a THREDDS Data Server. This includes parsing catalogs to identify access methods, as well as using Siphon's client for the netCDF Subset Service (NCSS) to download subsets of data in netCDF format.

Speakers | Moderators

Tuesday July 25, 2017 11:30am - 1:00pm
3 - Georgian 900 E 7th St, Bloomington, IN 47405

11:30am

Geographic Information Systems and HDF
Geographic information systems are increasingly important for scientists and decision makers trying to understand complex environmental problems that require integration of many different data types from multiple sources. Integrating HDF data into these tools can be a challenge for many users. This session will include updates from developers of several popular GIS tools: ESRI and Harris as well as a presentation on recently developed tools for accessing HDF data with the GDAL library.

Please let us know what tools you use with HDF: https://goo.gl/2fMMkU

Speakers | Moderators
avatar for Joe Lee

Joe Lee

Software Engineer, The HDF Group
HDF Product Designer | HDF(-EOS) / netCDF / GDAL | OPeNDAP / Hyrax / THREDDS / Pydap | Big data / Spark / Hadoop / Elasticsearch / Logstash / Kibana | Cloud / S3 / Lambda / Docker / Conda | Minecraft / AR / VR / WebGL | Machine Learning / Deep Learning / Keras.io... Read More →


Tuesday July 25, 2017 11:30am - 1:00pm
4 - Maple 900 E 7th St, Bloomington, IN 47405

11:30am

Citations for Software and Services
Citations for data are becoming more common place as data centers are minting DOIs for datasets and journals are requiring that data be cited. The next step is to apply this to software and services. Software and services provide analysis of data and can also produce data. The provenance of this should be captured in a way that easily traceable. Citations and identifiers can help provide this provenance.

This session will provide a summary of current software citation recommendations and formats. We will then go over the elements of each and see what works and what potential gaps there may be by trying to trace some example citations of each style. We will consolidate our results, which will be used to provide recommendations to the ESIP science software cluster and ESDIS software and services working group.

Google Doc https://docs.google.com/document/d/1XszSmEbgas0EKjEIn1EosqtD2W4-j7KtZdSvb72Oq44/edit?usp=sharing

Speakers | Moderators
avatar for Jessica Hausman

Jessica Hausman

Project Support, PODAAC
avatar for Shelley Stall

Shelley Stall

Director, Data Programs, AGU
Ask me about: | AGU's Data Management Assessment Program | FAIR Data | AGU's Data Fair


Tuesday July 25, 2017 11:30am - 1:00pm
5 - Oak 900 E 7th St, Bloomington, IN 47405

11:30am

NEXUS: Big Data Analytics and Cloud Computing
Participants should have completed the 'Introduction to NEXUS' workshop or have general understanding of NEXUS and Cloud Computing technologies. This workshop focuses deploying NEXUS on the Amazon cloud environment using AMI and ingesting data into the NEXUS platform for analysis.

Speakers | Moderators

Tuesday July 25, 2017 11:30am - 1:00pm
6 - Walnut 900 E 7th St, Bloomington, IN 47405

1:00pm

Lunch
Tuesday July 25, 2017 1:00pm - 2:00pm
Alumni Hall

1:00pm

SWEET Ontology Interest Group Lunch Meeting
A subcommittee of the Semantic Technology Committee focused on developing and evangelizing the SWEET ontology. We will discuss guidelines for managing and updating the SWEET ontologies, collaboration with other ontology development efforts such as ENVO, and encouraing adoption by the Earth science community. We would like especially to encourage subject matter experts who have expressed an interest in working with us on development and maintaining SWEET to attend.



Speakers | Moderators
avatar for Beth Huffer

Beth Huffer

Lingua Logica
avatar for Lewis John McGibbney

Lewis John McGibbney

DAAC Engineer, PO.DAAC
My name is Lewis John McGibbney, I am currently a Data Scientist at the NASA Jet Propulsion Laboratory in Pasadena, California where I work in Computer Science and Data Intensive Applications. I enjoy floating up and down the tide of technologies @TheASF having a real enthusiasm... Read More →


Tuesday July 25, 2017 1:00pm - 2:00pm
Alumni Hall

2:00pm

Earthdata Search Client Usability Workshop
In this hands on Workshop session we will discuss the approach and results of a recent NASA Usability Study on the Earthdata Search Client and Reverb data access applications. We'll review the findings, discuss the implications, and provide hands on experience with changes made as a direct result. We'll explore basic data access patterns as well as a new data services interface designed to simplify access to NASA's large Earth Science data holdings.

Speakers | Moderators
avatar for Kathleen Baynes

Kathleen Baynes

Systems Architect, ESDIS
avatar for Dan Pilone

Dan Pilone

Technical Lead, Element 84, Inc.
DS

Dana Shum

Development Manager, EED2 Program


Tuesday July 25, 2017 2:00pm - 3:30pm
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

2:00pm

Beef producers consider future conditions: Distilling a project report into a Climate Resilience Toolkit case study
This session-within-a-session will cover two topics: Attendees will hear how Extension personnel encourage beef producers to consider and prepare for potential future conditions—all without saying "climate change." Then, session attendees will draw from the presentation to outline a case study for the Climate Resilience Toolkit (CRT). Attendees can use strategies from the session to highlight their own projects' accomplishments.

This is the 4th workshop sponsored by ESIP's Agriculture & Climate Cluster to develop CRT case studies.

Workshop agenda:

- [5 min] Introduction to the workshop, logistics, larger goal to establish a CRT pipeline at the ESIP level

- [10 min] LuAnn Dahlman, NOAA, Introduction to CRT, including a backwards journey from a CRT case study to the story template

- [20 min] Crystal Powers, University of Nebraska-Lincoln and USDA Northern Plains Climate Hub, Climate Scenario Planning for Resilient Beef Systems

Resiliency to weather extremes is, on the one hand, a topic with which farmers and ranchers in the Northern Plains are already familiar, because of the typical wide range in temperature and moisture conditions in the region. But now, climate change is adding new uncertainties that make it difficult to know what are the best practices for the future. In an ongoing project, the UNL Extension team used scenario planning with beef system stakeholders to discover robust adaptive management options, prioritize Extension programming needs, and provide an open forum for discussion.

[45 min] - Group discussion on and drafting of an incipient story that would become a CRT case study

[10 min] - Wrap-up: Next steps; CRT "pipeline" in OSF

Speakers | Moderators
avatar for LuAnn Dahlman

LuAnn Dahlman

Science Writer and Editor, NOAA Climate Program Office
avatar for Nancy Hoebelheinrich

Nancy Hoebelheinrich

Principal, Knowledge Motifs LLC
See my LinkedIn profile at: https://www.linkedin.com/in/nancy-hoebelheinrich-0576ba3
CP

Crystal Powers

Extension Engineer, the University of Nebraska-Lincoln
avatar for Bill Teng

Bill Teng

Principal Scientist, NASA GES DISC (ADNET)


Tuesday July 25, 2017 2:00pm - 3:30pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

2:00pm

Be interesting and be heard: Communicating your science to policy makers (Part 1)
Communication skills are key to a successful career and important in all aspects of life. Effectively communicating science to different audiences can be a challenge. Most researchers know how to talk to other researchers (at least within their field), but policy makers have different needs when they are hearing about research. Researchers need to adapt their discussion to the audience - policy makers may have limited science background, but respect researchers’ expertise. Determining the “ask” ahead of time is key to a successful visit. Many scientists provide their supporting evidence to build to a conclusion, but most policy makers want to know the important message first, and then if they need more information get the supporting evidence.

Participants will leave knowing how to create clear and concise messages that are targeted to their audience, and will gain an understanding of how to approach policy makers on scientific issues.

Note: This is a two-part session. For the most effective workshop, participation will be limited to 20 people. Registration is available here; once we reach 20 registrants we will start a wait-list.

Speakers | Moderators
avatar for Denise Hills

Denise Hills

Director, Energy Investigations, Geological Survey of Alabama
Long tail data, data preservation, connecting physical samples to digital information, geoscience policy, science communication


Tuesday July 25, 2017 2:00pm - 3:30pm
3 - Georgian 900 E 7th St, Bloomington, IN 47405

2:00pm

Sustainable HDF Archives
The EOS Archive includes many products that were designed several decades ago and have successfully served science needs ever since. How can we insure that these products continue to serve these needs in the future? This session will include presentations by The HDF Group on data migration, product design, and standard metadata for HDF products.

Please let us know what tools you use with HDF: https://goo.gl/2fMMkU

Speakers | Moderators
avatar for Joe Lee

Joe Lee

Software Engineer, The HDF Group
HDF Product Designer | HDF(-EOS) / netCDF / GDAL | OPeNDAP / Hyrax / THREDDS / Pydap | Big data / Spark / Hadoop / Elasticsearch / Logstash / Kibana | Cloud / S3 / Lambda / Docker / Conda | Minecraft / AR / VR / WebGL | Machine Learning / Deep Learning / Keras.io... Read More →
avatar for Elena Pourmal

Elena Pourmal

Development Manager, EED2 Program


Tuesday July 25, 2017 2:00pm - 3:30pm
4 - Maple 900 E 7th St, Bloomington, IN 47405

2:00pm

Use Application Programming Interfaces of Data Repositories to Create Local Data Catalogs
The Environmental Data Initiative (EDI) data repository is a platform that allows ecological  researchers to archive data. However, while the repository provides search, download, and other data cataloging functions that facilitate data discoverability and access, research groups are often required to maintain a local catalog featuring those same data but on a project-specific website. Meeting this need is traditionally addressed by running two parallel systems: (1) the data submitted to the EDI repository, and (2) maintaining a local copy of the data catalog. This approach is inefficient and invites inconsistencies between systems. Although most repositories and DataONE provide APIs to access data in this breakout session, we will discuss and demonstrate how data within the EDI repository may be accessed using the PASTA+ API. The API may be used to harvest data associated with a particular research group, project, or station, which can then be branded and styled for display on a project website. Using this approach, a research group can generate a local catalog of project data by capitalizing on EDI data repository functionality, and avoid the overhead of maintaining two separate data catalogs.

Speakers | Moderators
avatar for Duane Costa

Duane Costa

Programmer Analyst 3, University of New Mexico
I am a developer with the Environmental Data Initiative.
avatar for Gastil Gastil-Buhl

Gastil Gastil-Buhl

Information Manager, Moorea Coral Reef Long Term Ecological Research
I curate datasets for an LTER site for reuse in future and current use by other research groups. I am interested in optimizing data usability and making the curation process more efficient. My favorite part of this work is the collegial spirit among LTER site information managers... Read More →


Tuesday July 25, 2017 2:00pm - 3:30pm
5 - Oak 900 E 7th St, Bloomington, IN 47405

2:00pm

Tools and Strategies for End-to-End Sensor Data Management
The low cost and ubiquity of electronic sensors and increasingly practical options for setting up sensor networks have revolutionized environmental science. Emerging standards and web services for accessing sensor data and tasking sensors via the internet are further accelerating this trend. Investigators can now acquire vast amounts of sensor data as they conduct their research, but efficiently managing, documenting and quality-controlling those data and submitting them to a suitable repository is still problematic for many. This breakout session will focus on practical software tools and cloud-based services for establishing an end-to-end sensor data management strategy. It will include four or five presentations on new cloud-based services for acquiring sensor data (e.g. CHORDS, OpenSensorHub), code libraries for documenting and quality-controlling data using common software frameworks (e.g. MATLAB, R), and options for automating submission of sensor data to repositories (e.g. EDI, other DataONE nodes). This session will also be coordinated with a panel discussion on sensor data standards and tool development needs for fully enabling end-to-end sensor data management.


Tuesday July 25, 2017 2:00pm - 3:30pm
6 - Walnut 900 E 7th St, Bloomington, IN 47405

3:30pm

Networking Break
Tuesday July 25, 2017 3:30pm - 4:00pm
Conference Lounge

4:00pm

Developing an Information Management Code Repository
We are proposing to develop an Information Management Code Repository. The code in the repository will focus on data formatting, quality control, automating ingestion, metadata generation, and implementation of best practices for data archiving. General data manipulation routines may be of interest to the science community as well as information managers.

The code repository will be implemented in Github, and code will be freely available for anyone to use. For this code repository to be successful, the code must be well documented, discoverable, and ‘hardened or generalized’ such that it can be readily re-used. In this working session, we propose to define best practices for the code repository. Some of the issues to be addressed during this working group meeting are:

1) development of a controlled vocabulary to organize the code and aid in code discovery
2) code organization within Github
3) repository governance: who will be evaluating the code to determine if it is ready to be archived, and what are the criteria (e.g.,
https://esipfed.github.io/Software-Assessment-Guidelines/index.html
https://www.software.ac.uk/sites/default/files/SSI-SoftwareEvaluationCriteria.pdf http://testbed.esipfed.org/sites/default/files/2015_AIST_Evaluations_Overview.pdf )
4) metadata format for code (e.g., https://github.com/NCEAS/eml , https://github.com/codemeta/codemeta/blob/master/README.md)

This code library will be supported by the Environmental Data Initiative (EDI).


Speakers | Moderators
avatar for Colin Smith

Colin Smith

Information Manager, Environmental Data Initiative


Tuesday July 25, 2017 4:00pm - 5:30pm
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

4:00pm

Observing Strategies using Satellites in Sensor Webs
Now that smallsats and cubesats have demonstrated the ability to produce science-quality data, new types of observing strategies are possible. Commercial/scientific viability is being demonstrated with PlanetLabs, CYGNSS, TROPIC and others. Specifically, constellations of small, inexpensive satellites can be effectively used to make observations that were impossible with single, large satellites. What new types of phenomena would be observable (think transient and transitional)? What new scientific questions can be answered? How would this data be different than that from a traditional battlestar gallactica? What would it take to permit the integration of such data into a more cohesive picture of the environment? The output of this session would be a concept paper, reflecting the potential opportunities and issues in information output from this type of platform.


Tuesday July 25, 2017 4:00pm - 5:30pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

4:00pm

Be interesting and be heard: Communicating your science to policy makers (Part 2)
Communication skills are key to a successful career and important in all aspects of life. Effectively communicating science to different audiences can be a challenge. Most researchers know how to talk to other researchers (at least within their field), but policy makers have different needs when they are hearing about research. Researchers need to adapt their discussion to the audience - policy makers may have limited science background, but respect researchers’ expertise. Determining the “ask” ahead of time is key to a successful visit. Many scientists provide their supporting evidence to build to a conclusion, but most policy makers want to know the important message first, and then if they need more information get the supporting evidence.

Participants will leave knowing how to create clear and concise messages that are targeted to their audience, and will gain an understanding of how to approach policy makers on scientific issues.

Note: This is a two-part session. For the most effective workshop, participation will be limited to 20 people. Registration is available here; once we reach 20 registrants we will start a wait-list.

Speakers | Moderators
avatar for Denise Hills

Denise Hills

Director, Energy Investigations, Geological Survey of Alabama
Long tail data, data preservation, connecting physical samples to digital information, geoscience policy, science communication


Tuesday July 25, 2017 4:00pm - 5:30pm
3 - Georgian 900 E 7th St, Bloomington, IN 47405

4:00pm

HDF Capabilities and Tools
The Hierarchical Data Format is used in many Earth Science communities and many other domains. This session will include presentations about a variety of HDF capabilities and tools.


Please let us know what tools you use with HDF: https://goo.gl/2fMMkU

Speakers | Moderators
avatar for Joe Lee

Joe Lee

Software Engineer, The HDF Group
HDF Product Designer | HDF(-EOS) / netCDF / GDAL | OPeNDAP / Hyrax / THREDDS / Pydap | Big data / Spark / Hadoop / Elasticsearch / Logstash / Kibana | Cloud / S3 / Lambda / Docker / Conda | Minecraft / AR / VR / WebGL | Machine Learning / Deep Learning / Keras.io... Read More →
avatar for Elena Pourmal

Elena Pourmal

Development Manager, EED2 Program


Tuesday July 25, 2017 4:00pm - 5:30pm
4 - Maple 900 E 7th St, Bloomington, IN 47405

4:00pm

Collecting and analyzing web service metrics, a Working Session of the Web Services cluster
This working session will enable Web Services Cluster members, and any who would like to join with us, to determine how best to collect and analyze metrics for web services. We are interested in all sorts of web services, but are initially focusing on OPeNDAP, WCS, WMS and, if we can include it too, ESIP OpenSearch. Topics to be discussed:
* How to we get information from servers;
* What role does ESIP play;
* How to optimize log files for diagnostics; and
* How do we intend to process these data.


Speakers | Moderators

Tuesday July 25, 2017 4:00pm - 5:30pm
5 - Oak 900 E 7th St, Bloomington, IN 47405

4:00pm

Using D3.js
There's no escaping the fact that data visualization is hot right now. Everyone wants to tell their data's story visually, whether it be through a map, chart, or more detailed presentation. The difficulty is there are so many different tools that solve this, each one with their own benefits and limitations.

D3.js is an awesome tool for handling this task -- which is the approach we've used for the sites like the Nation's Report Card, BlueCross BlueShield of North Carolina, GlobalChange, and others.

We'll cover:
1) What D3 is and why you should be using it
2) Getting data out of Drupal and into D3
* Views
* Drupal 8 Rest API
3) D3 visualization techniques
* Charts & Graphs
* Maps
* Specialized Data Visualizations

Speakers | Moderators
avatar for Adam Shepherd

Adam Shepherd

Software Developer, WHOI/BCO-DMO
Linked Data | Semantic Web | Drupal | Metadata | Marine Data


Tuesday July 25, 2017 4:00pm - 5:30pm
6 - Walnut 900 E 7th St, Bloomington, IN 47405
 
Wednesday, July 26
 

8:00am

Registration Open
Wednesday July 26, 2017 8:00am - 9:00am
East Lounge

9:00am

Plenary Welcome | 2017 Total Solar Eclipse Overview
Speakers | Moderators
EL

Emily Law

Data System and Technology Deputy Program Manager, NASA / Caltech / JPL
I am serving as the President of ESIP. I look forward to talking to all participants, getting to know them better, and helping where I can.
avatar for Erin Robinson

Erin Robinson

Executive Director, ESIP


Wednesday July 26, 2017 9:00am - 9:20am
7 - Whittenberger Auditorium 900 E 7th St, Bloomington, IN 47405

9:20am

Raskin Scholar

Bridging the Gap between NASA Earth Observations and Decision Makers

The NASA DEVELOP National Program bridges the gap between NASA Earth Science and communities worldwide by building capacity in both participants and partner organizations who collaborate on projects. These rapid feasibility projects highlight the capabilities of satellite and aerial Earth observations to address challenges in agriculture, weather, climate, health and air quality, disasters, energy, ecological forecasting, oceans, and water resources. Immersing decision and policy makers in these feasibility projects increases awareness of the capabilities of Earth observations and contributes to the tools and resources available to support enhanced decision making. This presentation will highlight two DEVELOP projects from the University of Georgia node that illustrate how DEVELOP builds capacity within user communities by increasing their understanding and use of NASA Earth observations.

The Atlanta Water Resources project introduced Landsat 8 OLI and Terra ASTER data to The Nature Conservancy to provide a spatial analysis for their proposed reforestation efforts aimed at reducing harmful stormwater runoff and modeling multiple land allocation scenarios. The results informed green infrastructure development planning for a sustainable and resilient urban environment in Atlanta. The Georgia Energy project incorporated Landsat 8 OLI and Terra CERES data to analyze land cover trends, cloud cover, solar radiation, and solar insolation. This study used Earth observations to create a solar site suitability and conflict identification model for the Georgia Department of Natural Resources, The Nature Conservancy, Georgia Power Company, and other solar farm developers. These projects improved dialog between Earth science practitioners and end-users to produce decision-support tools focused on green infrastructure and renewable energy development.


Speakers | Moderators
avatar for Caren Remillard

Caren Remillard

Caren Remillard was born in Atlanta, Georgia and received a Bachelor’s degree in Anthropology from Emory University in 2011 and a Master’s degree in Geography from the University of Georgia in 2014. As an undergraduate, she studied abroad at the University of Sydney in Austra... Read More →


Wednesday July 26, 2017 9:20am - 9:40am
7 - Whittenberger Auditorium 900 E 7th St, Bloomington, IN 47405

9:40am

Plenary
Speakers | Moderators
RL

Rebecca Lave

Associate Professor and Director of Undergraduate Studies | Department of Geography, Indiana University
BM

Brandon Montellato

Brandon Montellato is the University Relations Manager at DJI, serving as the lead liaison for university research. He also supports DJI STEAM academic partnerships to develop certification programs, curriculum, and education applications. Prior to joining DJI, Brandon led strate... Read More →
MZ

Matt Zumwalt

Program Manager, Protocol Labs website: https://flyingzumwalt.com | github: @flyingzumwalt | twitter: @flyingzumwalt | linkedin: https://www.linkedin.com/in/flyingzumwalt/ Matt is an advocate for re-decentralizing the web. He’s driven by a conviction that, rather than relying on intermediaries when we exchange information, the world needs technical, social and economic infrastructure that empowers communities to create, store and exchange data directly with each other. This conviction is reflected in Matt's 2016... Read More →


Wednesday July 26, 2017 9:40am - 11:00am
7 - Whittenberger Auditorium 900 E 7th St, Bloomington, IN 47405

11:00am

Networking Break
Wednesday July 26, 2017 11:00am - 11:30am
Conference Lounge

11:30am

Plenary
Speakers | Moderators
avatar for Beth Plale

Beth Plale

Co-PI HathiTrust Research Center, Indiana University
Science Director, Pervasive Technology Institute | Director, Data To Insight Center | Professor, Informatics and Computing | Indiana University
AR

Angie Raymond

Assistant Professor | Department Business Law and Ethics | Kelley School of Business
JM

Jennifer M. Schopf

Director, International Networks | Indiana University


Wednesday July 26, 2017 11:30am - 12:15pm
7 - Whittenberger Auditorium 900 E 7th St, Bloomington, IN 47405

12:15pm

Scientific Quality - Panel
The Information Quality Cluster has formally defined information quality as a combination of the following four aspects of quality, spanning the full life cycle of data products:
1. Scientific quality defined in terms of accuracy, precision, uncertainty, validity and suitability for use (fitness for purpose); 2. Product quality that takes the following into account: the degree to which the scientific quality is assessed and documented; how accurate, complete and up-to-date the metadata and documentation are; the manner in which the data and metadata are formatted; the degree to which the associated information is published and traceable throughout the data lifecycle; 3. Stewardship quality addressing questions such as how well data are being managed, preserved, and made accessible; and 4. Service quality that deals with how easy it is for users to discover, get, understand, trust, and use a given data product along with its metadata, as well as ensuring that an archive has the requisite knowledge base and people functioning as subject matter experts available to help its data users.

The purpose of this session is to focus on scientific quality, and especially on uncertainty. A panel of invited speakers from a variety of Earth science disciplines will address questions such as: How is uncertainty determined and characterized in the products of their research or application? What are the major side effects and limitations of common statistical techniques used to quantify and characterize uncertainty?  What is the impact of uncertainty on the quality of their data products? How is data uncertainty accounted for when multiple sources of data are spliced and woven into a single product? How do they document and convey the information about uncertainty to other scientific users? What is the best way of conveying uncertainty to the (possibly skeptical) public?
Ultimately, this session intends to provide expert knowledge from different perspectives on a relatively focused topic–Data Uncertainty–that is extremely challenging but critical in both establishing and elevating the user communities’ confidence in Earth Science data.  ​Hopefully, ​these presentations will lead to a community-wide discussion during and after the meeting on the problems, challenges, and solutions to a wide range of issues surrounding data uncertainty.
Discussion initiated during this plenary session inspired by the invited panelists’ presentations will be continued in the afternoon breakout session, http://sched.co/As6H, for a more collaborative interchange of technical information intended to help advance the Scientific Quality of Earth science data and to discuss effective ways to capture and communicate uncertainty to a broader community.

AGENDA

A. Introduction
- David Moroni

B. Panelists' Presentations:
  1. Information about the Data is as Important as the Data itself - Carol Anne Clayson (Woods Hole Oceanographic Institution)
  2. Data uncertainty: what is it, where does it come from, and why should we care? - Amy Braverman (Jet Propulsion Laboratory, California Institute of Technology)
  3. Challenges in Evaluating Global Climate Models with the Limited Observational Data Record - Isla Simpson (National Center for Atmospheric Research)
C. Q&A - All

SESSION NOTES: https://docs.google.com/document/d/1jdspBWgXt2B5vXeAw0R2qv7IQsCau9EeMUUsgGTU3Z0/edit

PRESENTATION ABSTRACTS & SPEAKER BIOGRAPHIES:
1.
Information about the Data is as Important as the Data itself  - Carol Anne Clayson (Woods Hole Oceanographic Institution)
Abstract: In this talk I will discuss the importance of creating accurate data information, including uncertainty estimates and data flags, for all steps of the data production and data usage pipeline.   I will use real world examples from the creation of a Climate Data Record, the use of satellite and in situ data for weather and climate problems, and from the development and planning of large programs such as the NASA Decadal Earth Science Survey. Through these examples I will highlight some of the difficulties in producing the metadata for a data set, as well as some practical uses of this data from the end user standpoint.

Biography: Carol Anne Clayson is a Senior Scientist in the Department of Physical Oceanography at the Woods Hole Oceanographic Institution (WHOI).  Dr. Clayson is the author or co-author of over 45 journal articles, two books on air-sea boundary layers and numerical ocean modeling, as well as holding one patent. She has served on numerous national and international science panels, including her current position as the co-chair of the Climate Panel for the Decadal Survey for Earth Science and Applications from Space.  Her current areas of research include understanding how air-sea interactions affect the climate scale, how the global water cycle is varying, and how the ocean responds to processes such as tropical cyclones.

2. Data uncertainty: what is it, where does it come from, and why should we care? - Amy Braverman (Jet Propulsion Laboratory, California Institute of Technology)
Abstract: NASA, NOAA, and other space agencies are producing massive quantities of data that will ultimately be used for science and decision making. These data are typically collected by observing systems that capture indirect measurements of the phenomena of interest (i.e. radiances), and then apply complex algorithms to infer the underlying geophysical phenomena. But, the physical observing mechanisms and algorithms are imperfect, and this creates uncertainty about the results. Probability provides a formal mechanism for quantifying uncertainty, in the form of probability distributions, that is coherent, intuitive, and mathematically precise. In this talk I will discuss why I believe that probabilities should be used to define data uncertainty in the remote sensing context, and the consequences of not doing so.

Biography:
Amy Braverman is Principal Statistician at the Jet Propulsion Laboratory. She holds a B.A. in Economics from  Swarthmore College, and after working for eight years in litigation support consulting, she entered UCLA where she earned an M.A. in Mathematics and Ph.D. in Statistics. She joined JPL in 1999 as a post doc, and has been there ever since. Dr. Braverman's research interests span several areas, all related to the use of massive remote sensing data sets in Earth and climate science: information-theoretic approaches to data reduction, data fusion using spatial and spatio-temporal statistical methods, development of new statistical methodologies for the evaluation of climate models by comparison to observations, and uncertainty quantification for remote sensing applications. She is a Fellow of the American Statistical Association, and serves on the editorial board of the Journal on Uncertainty Quantification. She is a Program Leader of the Statistical and Applied Mathematical Sciences' (SAMSI's) upcoming program on Mathematical and Statistical Methods for Climate and the Earth System (2017-2018), and served in similar roles for previous SAMSI programs on uncertainty quantification and massive data sets. In December 2008 she led the National Research Council's (NRC) Workshop on Uncertainty Management in the Use of Remote Sensing Data for Climate Studies, and was a member of the NRC's Committee on Applied and Theoretical Statistics from 2004-2010. In 2008 she and colleagues founded the American Statistical Association's Advisory Committee on Climate Change Policy.

3. Challenges in Evaluating Global Climate Models with the Limited Observational Data Record - Isla Simpson (National Center for Atmospheric Research)
 Abstract: A continuing challenge when it comes to evaluating global climate models is accounting for the uncertainties associated with the limited length of observation based records.  For many aspects of both the climatology and shorter term climate variability, the limited observational data record results in an uncertain ground truth against which to compare our models.  This motivates combining the use of the observational records that we do have with large model ensembles to assess both the uncertainty associated with short term records and the fidelity of our model simulations.  Here, the case of the Northern Hemisphere extra-tropical circulation response to the El Nino Southern Oscillation (ENSO) will be presented as an example and it will be shown that substantial uncertainties exist on the composite mean response to ENSO over the limited observational record due, in large part, to internal atmospheric variability (weather noise).     

Biography:
Isla Simpson is a Scientist in the Climate Analysis Section of the Climate and Global Dynamics Division, NCAR, studying large scale atmospheric dynamics and its representation in Global Climate Models. She has a Ph.D. degree in Atmospheric Physics from the Imperial College, London. She works to understand dynamical mechanisms involved in the variability and change of the large scale...

Moderators
DM

David Moroni

Data Engineer, Jet Propulsion Laboratory, Physical Oceanography Distributed Active Archive Center
I am a Data Engineer at the Jet Propulsion Laboratory working primarily on the PO.DAAC Project, which provides users with data stewardship services including discovery, access, sub-setting, visualization, extraction, documentation, and metadata for their oceanographic datasets... Read More →
avatar for Ge Peng

Ge Peng

Research Scholar, CICS-NC/NCEI
Dataset-centric scientific data stewardship, data quality management
avatar for H. K. “Rama” Ramapriyan

H. K. “Rama” Ramapriyan

Research Scientist, Subject Matter Expert, Science Systems and Applications, Inc.
Member, ESIP Data Stewardship Committee; Chair, ESIP Information Quality Cluster; Co-Chair, NASA ESDSWG Data Quality Cluster; NASA ESDIS Project (Assistant Project Manager, Retired).

Speakers | Moderators
DM

David Moroni

Data Engineer, Jet Propulsion Laboratory, Physical Oceanography Distributed Active Archive Center
I am a Data Engineer at the Jet Propulsion Laboratory working primarily on the PO.DAAC Project, which provides users with data stewardship services including discovery, access, sub-setting, visualization, extraction, documentation, and metadata for their oceanographic datasets... Read More →
avatar for Ge Peng

Ge Peng

Research Scholar, CICS-NC/NCEI
Dataset-centric scientific data stewardship, data quality management
avatar for H. K. “Rama” Ramapriyan

H. K. “Rama” Ramapriyan

Research Scientist, Subject Matter Expert, Science Systems and Applications, Inc.
Member, ESIP Data Stewardship Committee; Chair, ESIP Information Quality Cluster; Co-Chair, NASA ESDSWG Data Quality Cluster; NASA ESDIS Project (Assistant Project Manager, Retired).


Wednesday July 26, 2017 12:15pm - 1:00pm
7 - Whittenberger Auditorium 900 E 7th St, Bloomington, IN 47405

1:00pm

Lunch
Wednesday July 26, 2017 1:00pm - 2:00pm
Alumni Hall

1:00pm

2018 Geosemantics Symposium Lunchtime Planning Session
In this lunchtime-working session we will begin planning for the next Geosemantics Symposium, to be held in conjunction with the 2018 ESIP Winter Meeting. Come with your ideas for topics, speakers and interactive workshops.

Speakers | Moderators
avatar for Beth Huffer

Beth Huffer

Lingua Logica
avatar for Lewis John McGibbney

Lewis John McGibbney

DAAC Engineer, PO.DAAC
My name is Lewis John McGibbney, I am currently a Data Scientist at the NASA Jet Propulsion Laboratory in Pasadena, California where I work in Computer Science and Data Intensive Applications. I enjoy floating up and down the tide of technologies @TheASF having a real enthusiasm... Read More →


Wednesday July 26, 2017 1:00pm - 2:00pm
4 - Maple 900 E 7th St, Bloomington, IN 47405

2:00pm

Scientific Data Quality - Information Quality Cluster

Session Summary

This session is intended to follow the plenary session (http://sched.co/As6G) on Scientific Quality, which has a special focus on data uncertainty. The purpose of the session is to continue discussion initiated during the plenary session inspired by the invited panelists’ presentations from the plenary session. It will foster a more collaborative interchange of technical information intended to help advance the Scientific Quality of Earth science data and to discuss effective ways to communicate uncertainty to a broader community. This session will feature an additional invited speaker who will join the plenary speakers to facilitate this discussion. The panelists’ diverse perspectives and subject matter expertise will enable exchange of ideas and deeper technical discussion with the active participants in the session. .

As indicated in the plenary session description, the Information Quality Cluster has formally defined information quality as a combination of the following four aspects of quality, spanning the full life cycle of data products: 1. Scientific quality; 2. Product quality; 3. Stewardship quality; and 4. Service quality. The focus of this session is on scientific quality, and especially on uncertainty. In the preceding plenary session, a panel of invited speakers from a variety of Earth science disciplines will have addressed questions such as: How is uncertainty determined and characterized in the products of their research or application? What are the major side effects and limitations of common statistical techniques used to quantify and characterize uncertainty? What is the impact of uncertainty on the quality of their data products? How is data uncertainty accounted for when multiple sources of data are spliced and woven into a single product? How do they document and convey the information about uncertainty to other scientific users? What is the best way of conveying uncertainty to (possibly skeptical) public?

This session provides for more discussion with the panelists and other practitioners in scientific data generation, scientific quality assessment, and management. The discussion is expected to help identify issues pertaining to understanding, capturing and conveying uncertainty and to recommend actions that the Earth science data community can readily act upon to ensure and improve the overall scientific quality of their datasets.

 

Agenda

Introduction - H. K. "Rama" Ramapriyan       10 min.

Panelist Presentation:

  1. Bridging the gap between data and models - Matthew Plumlee  (University of Michigan) 15 min.

Key Points and Issues from Plenary Session – David Moroni – 5 min.

Additional Comments from Plenary Session Speakers:

  1. Information about the Data is as Important as the Data itself - Carol Anne Clayson (Woods Hole Oceanographic Institution)    5 min.
  2. Data uncertainty: what is it, where does it come from, and why should we care? - Amy Braverman (Jet Propulsion Laboratory, California Institute of Technology)   5 min.
  3. Challenges in Evaluating a Global Climate Models with the Limited Observational Data Record - Isla Simpson (National Center for Atmospheric Research)   5 min.

Open Discussion - All

Details:

Session notes at https://docs.google.com/document/d/16crMtTMJ_cqh0hpirS4nwn7hu-HzB6i7nip2YhZtWY4/edit

Bridging the gap between data and models - Matthew Plumlee (University of Michigan)

Abstract:  Commonly, data are used for model verification, validation, and calibration. Because data can be noisy and biased and models can be inexact, it is often done ad-hoc and does not produce the desired results.  This talk gives a brief, high-level overview of the data/model gap and some conceptual approaches to better work at this interface.

Biography: Matthew Plumlee’s studies include a method for statistical experimentation and uncertainty quantification for complex systems.   He received his PhD from the Georgia Institute of Technology and has since worked as an Assistant Professor the University of Michigan.  This fall, he will be joining Northwestern University's Industrial Engineering and Management Science Department.

Abstracts of the talks and biographies of the speakers from the plenary session can be found at http://sched.co/As6G.


Moderators
DM

David Moroni

Data Engineer, Jet Propulsion Laboratory, Physical Oceanography Distributed Active Archive Center
I am a Data Engineer at the Jet Propulsion Laboratory working primarily on the PO.DAAC Project, which provides users with data stewardship services including discovery, access, sub-setting, visualization, extraction, documentation, and metadata for their oceanographic datasets... Read More →
avatar for Ge Peng

Ge Peng

Research Scholar, CICS-NC/NCEI
Dataset-centric scientific data stewardship, data quality management
avatar for H. K. “Rama” Ramapriyan

H. K. “Rama” Ramapriyan

Research Scientist, Subject Matter Expert, Science Systems and Applications, Inc.
Member, ESIP Data Stewardship Committee; Chair, ESIP Information Quality Cluster; Co-Chair, NASA ESDSWG Data Quality Cluster; NASA ESDIS Project (Assistant Project Manager, Retired).

Speakers | Moderators
DM

David Moroni

Data Engineer, Jet Propulsion Laboratory, Physical Oceanography Distributed Active Archive Center
I am a Data Engineer at the Jet Propulsion Laboratory working primarily on the PO.DAAC Project, which provides users with data stewardship services including discovery, access, sub-setting, visualization, extraction, documentation, and metadata for their oceanographic datasets... Read More →
avatar for Ge Peng

Ge Peng

Research Scholar, CICS-NC/NCEI
Dataset-centric scientific data stewardship, data quality management
avatar for H. K. “Rama” Ramapriyan

H. K. “Rama” Ramapriyan

Research Scientist, Subject Matter Expert, Science Systems and Applications, Inc.
Member, ESIP Data Stewardship Committee; Chair, ESIP Information Quality Cluster; Co-Chair, NASA ESDSWG Data Quality Cluster; NASA ESDIS Project (Assistant Project Manager, Retired).


Wednesday July 26, 2017 2:00pm - 3:30pm
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

2:00pm

What role should ESIP play in transitioning technologies from research to operations?
Federal agencies and big companies both struggle at times in taking the big ideas coming out of research and development and turning them into operational reality. This session will pose some of these challenges from different sectors with real world use cases, ask participants to brainstorm how the activities going on in ESIP can potentially address these challenges, and work together to come up with a project to test the ideas in followup actions from the ESIP summer meeting.

Speakers | Moderators

Wednesday July 26, 2017 2:00pm - 3:30pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

2:00pm

“Data rescue” & ESIP
In the past few months, a range of grassroots initiatives, termed "data rescue," "data refuge," and "guerrilla archiving", have gained significant momentum to duplicate US government agency data. These initiatives are inspired by recent reports that scientific data and documentation have been removed from government websites, and by concerns over US budget proposals that slash scientific budgets. Much of this effort has focused on creating copies of Earth science data generated and held by US federal agencies, with hundreds of volunteers across the country and internationally spending considerable time and energy.

Early connections have been made between the “rescue” volunteers and the federally-funded data community; these conversations have highlighted some of the different perspectives and opportunities regarding agency data. Members of the ESIP Data Stewardship committee recently produced a document that provided information and recommendations from the data center perspective, http://www.esipfed.org/press-releases/stronger-together. This panel will present responses to this document from the “data rescue” community, and will open discussion for any further action the ESIP community can/should take on this issue.

Speakers | Moderators
avatar for Ruth Duerr

Ruth Duerr

data; identifiers; citations; discovery and access
avatar for Margaret Janz

Margaret Janz

Data Curation Librarian, University of Pennsylvania
I'm the data curation librarian at UPenn and a member of the Data Refuge team and Libraries+ Network efforts there. Interested in open data, open science, and ensuring continued access to government information.
MM

Matthew Mayernik

Project Scientist / Research Data Services Specialist, National Center for Atmospheric Research (NCAR)
avatar for Nancy Ritchey

Nancy Ritchey

Archive Branch Chief, NOAA/NCEI
avatar for Justin Schell

Justin Schell

University of Michigan Library - Shapiro Design Lab
Justin Schell is the Director of the Shapiro Design Lab, a peer learning and project design community at the University of Michigan Library. Passionate about all things community and citizen science, he has helped organize Data Rescue events across the country. He is also the fou... Read More →


Wednesday July 26, 2017 2:00pm - 3:30pm
3 - Georgian 900 E 7th St, Bloomington, IN 47405

2:00pm

Documentation Systems: Solutions for Metadata Curation and Creation

We are living in a world with many metadata dialects and many multi-organization/dialect repositories. We’ll be having a conversation about some of the challenges that arise for the organizations contributing metadata and the organizations that create these centralized systems. These and other questions that will be explored with short presentations and open discussion:

  • EMMA has metadata from many NOAA Line Offices and groups within data centers – they do metrics…
  • CMR has metadata from 18 DAAC collections (ECHO) and many GCMD collections (DIF) – the largest collection metadata collection in CMR is NOAA_NCEI – How does NCEI interact with CMR?
  • All feds have experience having their metadata harvested into data.gov (mangled?)
  • The SciOps collection (formerly GCMD) in CMR has over 1500 providers. How does that work?
  • DataOne has many member nodes, sub-communities, and dialects; do community recommendations influence multiple dialects?
  • NCEI Silver Spring has recently become a DataOne Node – how was that experience from a metadata provider point of view?
  • Organizations harvest metadata from many providers – how do they validate? How do they know the metadata is complete for their community and it's recommended concepts?

Speakers | Moderators
avatar for Sean Gordon

Sean Gordon

Metadata Developer, The HDF Group
I co-chair the Documentation Cluster, I'd love to hear from you what directions you'd like to see the cluster headed in the coming year. | I'm excited to run a Metadata Improvement Lab at ESIP that will teach participants to initiate a toolkit for evaluating metadata in any di... Read More →


Wednesday July 26, 2017 2:00pm - 3:30pm
4 - Maple 900 E 7th St, Bloomington, IN 47405

2:00pm

Drone Cluster Session
Report backs and strategy developments from around the growing sUAS data management domain

Speakers | Moderators
LB

Lindsay Barbieri

University of Vermont


Wednesday July 26, 2017 2:00pm - 3:30pm
5 - Oak 900 E 7th St, Bloomington, IN 47405

2:00pm

Extending use of the Open-data CovJSON-LD Format for Improved Interpretation of Coverage Data at NASA JPL’s PO.DAAC
CoverageJSON (or “CovJSON” for short) is a JSON-based format and specification for encoding coverage data. It is primarily intended to be used as a means to convey data from servers to web browsers, so that web developers can develop rich, highly interactive applications based on coverage data. This presentation will update and extend our previous work (presented at AGU Fall Meeting 2016) entitled "An Innovative Open Data-driven Approach for Improved Interpretation of Coverage Data at NASA JPL’s PO.DAAC" by displaying how we use URIs to encode semantic information and context through linked data mechanisms such as JSON-LD e.g. CovJSON-LD. This presentation will be of significant appeal to both semantic technologies and visualization enthusiasts.

Speakers | Moderators
avatar for Beth Huffer

Beth Huffer

Lingua Logica
avatar for Lewis John McGibbney

Lewis John McGibbney

DAAC Engineer, PO.DAAC
My name is Lewis John McGibbney, I am currently a Data Scientist at the NASA Jet Propulsion Laboratory in Pasadena, California where I work in Computer Science and Data Intensive Applications. I enjoy floating up and down the tide of technologies @TheASF having a real enthusiasm... Read More →



Wednesday July 26, 2017 2:00pm - 3:30pm
6 - Walnut 900 E 7th St, Bloomington, IN 47405

2:00pm

Provisional NOAA GOES-16 Data on the Cloud
This late-breaking session will discuss the just-announced (July 10) availability of provisional data from the new GOES-16 geostationary weather satellite operated by the National Oceanic and Atmospheric Administration (NOAA). As part of the NOAA Big Data Project, several Cloud platforms will be making the GOES-16 data accessible for computation and egress. Some details are still to be finalized, and the data are still considered Provisional rather than Operational pending final check-out of the spacecraft and data quality control. This session will provide whatever information is known at the time, followed by an open discussion of potential ideas, tools, issues, and focused activities around these data. Of particular interest are thoughts regarding how to encourage and facilitate in-place computation, particularly using Cloud-native services, rather than mere downloading of data.

Note: Remote access will not be available for this last-minute session.

Speakers | Moderators
avatar for Jeff de La Beaujardiere

Jeff de La Beaujardiere

Data Management Architect, NOAA
Jeff de La Beaujardière received his B.A. in physics in 1985 from the University of California, Berkeley, and his Ph.D. in astrophysics in 1990 from the University of Colorado, Boulder. Captivated by the Web’s potential for information distribution, in 1994 he joined the NASA... Read More →



Wednesday July 26, 2017 2:00pm - 3:30pm
7 - Whittenberger Auditorium 900 E 7th St, Bloomington, IN 47405

3:30pm

Networking Break
Wednesday July 26, 2017 3:30pm - 4:00pm
Conference Lounge

4:00pm

Annotating datasets with measurement classes from the Ecosystem Ontology (ECSO)
A new measurement ontology, the Ecosystem Ontology (ECSO), is now available, and a system for annotating datasets in DataONE is nearing completion. Because ontologies include parent-child structure, synonyms and enable formal logic, the data discovery process can be streamlined when dataset metadata incorporates annotations from ontologies. The ECSO project currently focuses on measurements related to carbon cycling - such as fluxes (e.g., ecosystem exchange, NPP, respiration) and concentrations (of pigments, carbon compounds). Measurement descriptions include units, applicable observed entities (both terrestrial and aquatic), and methods, and are based on a broad corpus of datasets found in DataONE. Additions are planned to add other classes of measurements to ECSO. During this workshop, we will explore the ECSO ontology and demonstrate the annotation system. Participants are invinted to bring their own EML-described datasets for advice on annotating them with the ECSO ontology.

Speakers | Moderators
MO

Margaret O'Brien

Data Manager, Marine Science Institute, UCSB


Wednesday July 26, 2017 4:00pm - 5:30pm
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

4:00pm

Research to Operations Transition
Many excellent research projects have failed to make the transition into an operational capability. This session will bring together operations managers and developers in the area of tropical storms, hurricanes and other cyclonic activity and discuss experiences in bring research into the world of operational forecasting and disaster response. The outcome of this session will be a list of lessons learned and an assessment of what worked and why other things did not.


Wednesday July 26, 2017 4:00pm - 5:30pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

4:00pm

Science Gateways for Earth Science
Science gateways are user environments and science-centric middleware that broaden and deepen access to scientific computing, data, and other resources to support scientific research. Science gateways enable users to more easily use scientific applications on diverse high performance and cloud computing resources, to discover and analyze data, to reproduce computational experiments, and to share and discuss tools and results with colleagues.
Many successful science gateways have been built, but there have also been many failures, and even successful gateways must find ways to sustain themselves. To solve these problems, the National Science Foundation has funded the Science Gateways Community Institute (SGCI) as one of its two initial software sustainability institutes. The SGCI’s mission is to increase the number of successful science gateways, and to help those gateways sustain and grow their user communities. To accomplish this mission, the SGCI provides services for science gateway developers. Services include incubation support, which helps gateway developers to develop a business startup mentality towards their projects; extended developer support, which provides projects with embedded technical assistance; the scientific software collaborative, which enables the community to publish and find software that can be used for building gateways; community engagement and exchange, which helps build the overall science gateway community; and workforce development, which helps increase the pipeline of student science gateway developers.
The goal of this session will be to increase the awareness of the potential for science gateways in the ESIP community. The session will feature a presentations on current earth science gateways, a presentation on earth science and general purpose science gateway technologies that can be used as science gateway platforms, a presentation on NSF-funded SGCI services that can be used to develop gateways, and a discussion of the general and specific requirements of the earth science community.


Wednesday July 26, 2017 4:00pm - 5:30pm
3 - Georgian 900 E 7th St, Bloomington, IN 47405

4:00pm

Integrating RMap with the Open Science Framework: New tools to build and navigate your map of scholarly research products and collaborators
Today's scholarly works can be dynamic, distributed, and complex. They can consist of multiple related components (article, dataset, software, webpage, multimedia, etc.) that are made available asynchronously, assigned a range of identifiers, and stored in different repositories. Each work may involve various funders, institutions, and contributors in a range of roles.

Using semantic web technologies, the RMap platform is designed to capture and preserve maps of relationships amongst scholarly works, and make the connections discoverable to relevant user communities. Through RMap, researchers can express the full complexity of a work, and navigate the network that forms both visually and through an API.

The RMap team is currently working with the Center for Open Science to build features that makes RMap functionality available through the Open Science Framework (OSF) platform. OSF is a free, open source web application that provides project management support for researchers across the entire research life-cycle. Researchers can use the platform to collaborate on projects, manage files, and share research with the public. One OSF-RMap feature will provide the option to create and display relationships between OSF projects and other related materials that are stored outside of OSF. Another will allow OSF users to express the roles they played in different projects as RMap relationships. A third will support the display of a users’ most frequent collaborators.

In this session, we will provide an introduction to both RMap and OSF.  There will then be an interactive walk-through of the proposed features.  Attendees will be invited to follow along and try out the RMap visualization tool. The RMap team have partnered with the ESIP Usability Cluster to harness meaningful feedback provided through the expertise of the ESIP community members regarding the proposed features. Participants will get the unique opportunity to preview the proposed features and contribute feedback that we will use to improve this work as it moves into production.

The RMap project was funded by the Alfred P. Sloan Foundation and carried out as a partnership between Data Conservancy, Portico, and IEEE.

Speakers | Moderators
avatar for Reid Boehm

Reid Boehm

Data Management Consultant, JHU Data Management Services
avatar for Karen Hanson

Karen Hanson

JHU, Data Conservancy
SH

Sophie Hou

Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.


Wednesday July 26, 2017 4:00pm - 5:30pm
4 - Maple 900 E 7th St, Bloomington, IN 47405

4:00pm

Geoscience in the Cloud: Data-Proximate Science to Address Big Data and Open Science Challenges

The advent of cloud computing has opened new avenues for not only addressing both big data and Open Science challenges to accelerate scientific discoveries. However, to successfully leverage the enormous potential of cloud technologies, it will require the data providers and the scientific communities to develop new paradigms to enable next-generation workflows and transform the conduct of science.

In this session, speakers will present ongoing work to facilitate a new paradigm for doing science by offering a suite of tools, resources, and platforms to leverage cloud technologies for addressing both big data and Open Science/reproducibility challenges, and engage in a community dialog on the opportunities and challenges of the new paradigm.

4:00-4:20 “Data Proximate Computing in the Cloud”,  Mohan Ramamurthy (Unidata Program Center)

Rapid advances in computing, communications, and observational technologies — along with concomitant advances in high-resolution modeling, ensemble and coupled-systems predictions of the Earth system — are revolutionizing nearly every aspect of the geosciences. Modern data volumes from high-resolution ensemble prediction/projection/simulation systems and next-generation remote-sensing systems like hyper-spectral satellite sensors and phased-array radars are staggering. In this talk we will discuss strategies, tools and techniques Unidata is taking to address these issues.

4:20-4:40 “Jetstream Cloud”,  Jeremy Fischer (Indiana University)

NSF’s Jetstream Cloud is a Infrastructure-as-a-Service platform comprised of two geographically isolated OpenStack+Ceph clusters, each supporting hundreds of virtual machines and data volumes. The two cloud systems are integrated via a user-friendly web application that provides a user interface for common cloud computing operations, authentication to XSEDE via Globus, and an expressive set of web service APIs. Jetstream enables on-demand access to interactive, user-configurable computing and analysis capability. It also seeks to democratize access to cloud capabilities and promote sharable, reproducible research. This talk will describe Jetstream in greater detail, as well as how its unique combination of hardware, software, and user engagement support the "long tail of science".

4:40-5:00 “Web APIs in the Cloud”, James Gallagher (OPeNDAP)

The OPeNDAP and HDF groups have been working some experiments in cloud-based data servers for NASA. Our results to-date include cost- and performance-modeling for three distinct architectures serving two test datasets (large collections of AIRES and MERRA data). Our study exclusively used AWS EC2 and S3.

5:00-5:20 “Met/Ocean Modeling Workflows on XSEDE via HPC & Cloud”, Rich Signell (USGS) and Julien Chastang (Unidata Program Center)

Met/ocean modeling workflows have technical challenges: compute power with fast connections for producing simulations, large storage space for output, powerful tools for analysis and visualization, and specialized services for effectively distributing data.  To address this issues, we have developed a sample end-to-end workflow solution for generating and handling simulation data on NSF XSEDE HPC and cloud computing resources.  Simulations are conducted on HPC systems (e.g. “Bridges”, “Stampede”) and then transferred to the Jetstream Cloud via Globus GridFTP. Model output can then be analyzed and visualized using JupyterHub, keeping computations close to the data, and taking advantage of custom Python environments that are preinstalled.  Data can also be shared with colleagues outside XSEDE using a THREDDS Data Server running on Jetstream.  The components are packaged as Docker containers for deployment on other systems.

5:20-5:30 “Next Steps and Challenges” (Speakers and Attendees)


Speakers | Moderators
avatar for Julien Chastang

Julien Chastang

Software Engineer, UCAR - Unidata
Scientific software developer at UCAR-Unidata.
avatar for Rich Signell

Rich Signell

Oceanographer, USGS Coastal and Marine Geology Program
Ocean Modeling, Python, NetCDF, THREDDS, ERDDAP, UGRID, SGRID, CF-Conventions, Jupyter, JupyterHub, CSW, TerriaJS


Wednesday July 26, 2017 4:00pm - 5:30pm
5 - Oak 900 E 7th St, Bloomington, IN 47405

4:00pm

The Institutional Research Data Lifecycle – A Researcher’s Gauge of Frustration
Institutions are integral in the support of the scientific process. They establish the policy and procedure for researchers to follow ensuring quality and integrity across the institution. They bear the risk and impact when research does not stand up under scrutiny, or data are improperly handled or used in research.

Institutions are challenged with meeting the needs of a researcher while at the same time managing the infrastructure and polices that protect the institution’s integrity.

Examples of Researcher Frustration and Institutional Risk:
• A researcher with a 6-month grant where the institutional steps to download the needed data were not completed until the 7th month.
• A researcher who needs a dataset with compliance and regulations requirements that add significant risk to the institution cyber-infrastructure resulting in not accepting a grant because the capacity to comply was not in place.

In this session, we invite the institution community and the greater stakeholder community to discuss the research data lifecycle, how institutions are helping researchers with data challenges, and the numerous compliance requirements that are levied by institutions on researchers, and how we can make a more streamline process for the scientific process.

Session Notes

Speakers | Moderators
avatar for Karl Benedict

Karl Benedict

Director of Research Data Services, University of New Mexico
For nearly 30 years Karl Benedict has had parallel careers in Information Technology, Data Management and Analysis, and Archaeology. Over the last 22 years at UNM he has worked as a Graduate Student in Anthropology, Research Scientist, Applied Research Center Director, and curren... Read More →
avatar for Shelley Stall

Shelley Stall

Director, Data Programs, AGU
Ask me about: | AGU's Data Management Assessment Program | FAIR Data | AGU's Data Fair


Wednesday July 26, 2017 4:00pm - 5:30pm
6 - Walnut 900 E 7th St, Bloomington, IN 47405

7:00pm

Research as Art
Once again at this year’s ESIP Summer Meeting we'll hold a Research as Art event on the evening of Wednesday, July 26. Our goal is to encourage the ESIP community to use visual media to communicate their data and research; and to think about their research as an ongoing narrative that can be told through visual media. This event is about showing how the ESIP community uses data. The idea is to have a range of entries that show the diversity of research done by members of our community, as well as their creativity and the impact of their work, in an engaging and accessible way. 

Sponsors
avatar for ESRI

ESRI

ESRI


Wednesday July 26, 2017 7:00pm - 9:00pm
WonderLab 308 W 4th St, Bloomington, IN 47404
 
Thursday, July 27
 

8:00am

Registration Open
Thursday July 27, 2017 8:00am - 9:00am
East Lounge

8:30am

Educators Workshop: Drones, Data and The Great American Eclipse
Fifteen to twenty regional educators will join the ESIP education community at this exciting 1-day workshop. Participating teachers will receive a recreational drone, learn about several data-related activities within the ESIP community, and the August 2017 Eclipse. This full day workshop will take place on Thursday July 27th, the agenda follows:

8:30- Welcome, Introductions, ESIP Overview - Shelley Olds, ESIP Education Chair
9:00- NOAA Data in the Classroom - June Teisan
10:00- Using Mapping Tools to work with Natural Hazards Data- Robert Downs
10:30 - break
11:00- Getting the Most out of GOES-16 Data - Margaret Mooney
11:30- The Great American Eclipse* - Pat Reiff
12:00- Eclipse Outdoor Activities
12:30 - LUNCH
1:30 - Drone workshop - Flight basics - Shelley Olds & LuAnn Dahlman
2:00 - Practicing controlled flight
2:30 - Taking photos and videos with your drone
3:00 - Try out an activity
3:30 - Break
4:00 - About sensors
4:30 - Develop action plans
5:00 - Evaluation and wrap-up
5:30 - Adjourn

Workshop Materials
http://wiki.esipfed.org/index.php/Education/2017edworkshop

*everyone is welcome

Speakers | Moderators
avatar for LuAnn Dahlman

LuAnn Dahlman

Science Writer and Editor, NOAA Climate Program Office
avatar for Shelley Olds

Shelley Olds

Science Education Specialist, UNAVCO
Data visualization tools, drones and STEM, 3-D imaging (structure from motion), human dimensions of natural hazards, disaster risk reduction (DRR), resilience building.


Thursday July 27, 2017 8:30am - 10:30am
7 - Whittenberger Auditorium 900 E 7th St, Bloomington, IN 47405

9:00am

New platforms and techniques for strengthening ties between observations and user communities
Over the past couple of years research data stewardship platforms and tools have grown substantially. Yet when we say "grown" it may not be quite in the way one might think. Instead of growing larger as a unit, a measurement of growth here is in a systems ability to connect and the perception of relevance and utility of that system to the user communities. In this session we propose to introduce several platforms and initiatives within the research data management realm that exhibit this enhanced capacity to connect with the user on their terms. These platforms are unique in their missions yet they also connect with one another in ways that advance the overall utility for the user.  We would like to get a larger picture of the current work of several of these entities, their common threads, and the specific projects with which they are engaged.
We are honored to have the following speakers present:
  • Josh Nicholson with Authorea - a document editor that allows researchers to write, cite, collaborate, host data and publish content online.
  • Laurel Haak with OrcID - a nonprofit organization whose service provides a unique identifier for individuals to use with their name as they engage in research, scholarship, and innovation activities.
  • John Kunze with EZID - a service conceived, built, and maintained by the California Digital Library to create and manage globally unique IDs for data and other sources.
  • Amy Krichhoff with Portico - a large, international digital preservation service designed by the non-profit group ITHAKA to facilitate the academic community's use of digital technologies and sustainable preservation of the scholarly record.
  • Karen Hanson with RMap - a project to develop mechanisms to preserve the many-to-many complex relationships among scholarly publications and their underlying data, thereby supporting the continual development of scholarly communication and digital publishing. (This will be an update from the winter session.)
  • Mercè Crosas with Dataverse - an open source project developed at Harvard, Dataverse is a research repository software package that allows its users to share, preserve, cite, explore, and analyze research data via an online web application.
By leveraging the ways these entities work together in the user's favor, this in turn helps to solve the issue of too many disparate and unsustainable singular operations. As Simon Porter (2016) notes, system boundaries are disappearing and in its place emerges a "new research information citizenship" - one based on a network of contributors not confined to one specific system. Looking at these groups we can begin to see this emerging. The platforms and their utilities are of interest to ESIP members on several different levels. They represent some of the latest work in user-centered research data platforms and they also focus on openness, transparency, and ways to foster connections among different stakeholders and data user groups. In turn ESIP provides a unique venue for these platforms to explore new ways to work with the communities most engaged in working with multiple types of earth science data. 
The session will conclude with a discussion of gaps and the path forward.
Science, Digital; Porter, Simon (2016): Digital Science White Paper: A New ‘Research Data Mechanics’. figshare.https://dx.doi.org/10.6084/m9.figshare.3514859.v1 Retrieved: 16 56, Oct 27, 2016 (GMT)

Speakers | Moderators
avatar for Reid Boehm

Reid Boehm

Data Management Consultant, JHU Data Management Services
avatar for Ruth Duerr

Ruth Duerr

data; identifiers; citations; discovery and access
avatar for Karen Hanson

Karen Hanson

JHU, Data Conservancy


Thursday July 27, 2017 9:00am - 10:30am
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

9:00am

Modeling & Forecasting
Modeling and forecasting capabilities have evolved considerably over the past 10 years. Hydrology, atmospheric chemistry, physical oceanography and weather have all experienced substantial improvements, but more are on the way. In addition, the tools upon which these models run are improving rapidly, including the supercomputing capabilities and the related analysis tools. A panel will describe modern advances and then open the floor for discussion. This session will identify new techniques and technologies that can be leveraged from the current advances to improve the speed and accuracy of these models. The output of this session will be a Paper summarizing the impact of the emerging capabilities and what can be done with them.


Thursday July 27, 2017 9:00am - 10:30am
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

9:00am

Lowering Barriers to Metadata Curation
Metadata creation and curation is often confusing for those outside of the digital repository community. Many data providers wish to provide accurate and helpful information for metadata but are often overwhelmed by the various metadata standards and the jargon associated with many metadata concepts. This session explores ways to lower these barriers to metadata creation and curation.

Speakers | Moderators
avatar for Sean Gordon

Sean Gordon

Metadata Developer, The HDF Group
I co-chair the Documentation Cluster, I'd love to hear from you what directions you'd like to see the cluster headed in the coming year. | I'm excited to run a Metadata Improvement Lab at ESIP that will teach participants to initiate a toolkit for evaluating metadata in any di... Read More →


Thursday July 27, 2017 9:00am - 10:30am
3 - Georgian 900 E 7th St, Bloomington, IN 47405

9:00am

Come tell us your ESIP Story!
Come in and tell us about your experience with ESIP. We will be taking video and audio to capture ESIP stories and outcomes over the past, nearly, 20 years. 

Speakers | Moderators
avatar for Bruce Caron

Bruce Caron

Community Architect, ESIP


Thursday July 27, 2017 9:00am - 10:30am
Hoosier

9:00am

Free and Open Source Software (FOSS) and Technologies for the Cloud
There are many freely available software and technologies that can improve DevOps for the Cloud. We invite presentations on the applications and lesson-learned on these solutions. We invite presentations on container technology, orchestration and automation, container and virtual machine management, data analytic, machine learning, data management, etc.

Current list of invited speakers
  • Nga Quach - JPL - Docker Container Deployment
  • Joe Jacob - JPL - Big Data Analytics using Apache Spark
  • Phil Yang - GMU - Building private cloud environment using Eucalyptus and OpenSpack
  • John Readey - HDF Group - HDF Data Service
  • Banjamin Banavige and Karthik Garimella - GSFC - Jupyter Notebook Recipes

Speakers | Moderators

Thursday July 27, 2017 9:00am - 10:30am
4 - Maple 900 E 7th St, Bloomington, IN 47405

9:00am

Implementing OGC-Compliant Sensor Networks
Standards are the corner-stone of interoperability. As we begin integration of sensors directly into a network of data, providing fully-described metadata becomes a requirement to enable assessment and assurance of data quality. This workshop provides hands-on introduction to tools and best-practices that enable development of resources that can operate within the standards-based, community-adopted frameworks of the Open-Geospatial Consortium and the W3C Semantic Web.

Speakers | Moderators
JF

Janet Fredericks

Woods Hole Oceanographic Institution
avatar for Carlos Rueda

Carlos Rueda

Software Engineer, MBARI
My areas of expertise and interest include scientific data management, visualization, data integration and interoperability, programming languages, and semantic web. http://www.mbari.org/rueda-carlos/


Thursday July 27, 2017 9:00am - 10:30am
5 - Oak 900 E 7th St, Bloomington, IN 47405

9:00am

Sustainable Data Management, Return on Investment
The Sustainable Data Management Cluster draws on the collective experience of those managing data and repositories to promote collaboration and curation strategies that benefit both research networks and individual investigators. A major activity of this group is to develop a framework for calculating Return on Investment (RoI) in data repositories. Recent activities include targeted workshops (with NSF funding) and ESIP meeting sessions. 

Defining RoI in the research data repository setting is challenging, but in normal business terms calculating RoI provides an assessment of the efficiency of an investment by performing a comparison of the amount invested versus the gain resulting from the investment. It is calculated by dividing the measured benefit by the cost of the investment and is normally expressed as a ratio or percentage:
ROI = (Calculated gain from investment - Cost of investment)/Cost of investment

In this session, we will continue the examination of current approaches to evaluating the RoI of research data repositories with an expert in that field, Neil Beagrie (remotely), with data that pertain to our own repositories. Our goals are a white paper outlining proposed approach(es) for assessing the ROI of existing repositories with suggested best practices and defined metrics.

Speakers | Moderators
MO

Margaret O'Brien

Data Manager, Marine Science Institute, UCSB


Thursday July 27, 2017 9:00am - 10:30am
6 - Walnut 900 E 7th St, Bloomington, IN 47405

10:30am

Networking Break
Thursday July 27, 2017 10:30am - 11:00am
Conference Lounge

11:00am

Users, meet your observations. Observations, meet your users. Mediation: a role for cognitive computing?

Cognitive computing and artificial intelligence have seen significant advances in recent years — from Watson to Alexa. At the same time, data stewardship remains constrained by the human effort needed to interpret and describe the context and implicit knowledge necessary to enable effective data reuse, especially across disciplines. At its heart, data curation is an act of mediation — the act of representing and presenting complexity and perspective across difference. Over time, mediation has become increasingly complex in information science. And while this mediation is increasingly reliant on technologies such as the semantic web and automated processes, it is still overly reliant on human interpretation to explain everything from a missing value indicator (-999, -, “”, n/a, …) to complex choices around the statistical methods  applied. If curation and mediation are going to scale to “Internet of Things” levels of complexity and volume they will need to be much more automated. This will require both improved technologies and a better understanding of how researchers think and make choices.

In this session we seek to have a community discussion on whether there is a role for cognitive computing in reducing the curatorial bottleneck of data reuse and what ESIP might do to facilitate that work. Fox and Parsons will introduce the topic, but we then expect a ranging and engaged discussion. Come with ideas, insights, and provocations.


Speakers | Moderators
avatar for Peter Fox

Peter Fox

Constellation Chair, Tetherless World Constellation, RPI



Thursday July 27, 2017 11:00am - 12:30pm
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

11:00am

Advancing netCDF-CF
Update on netCDF-CF activities. Discuss possible ESIP activities and next steps.

Speakers | Moderators

Thursday July 27, 2017 11:00am - 12:30pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

11:00am

Metadata Improvement Lab: Leveraging JuPYter Notebook to Evaluate Metadata
The Second MILE (Metatdata Improvement Lab at ESIP) allows the user to select XML collections and zip them up to send to a web service that evaluates the records for conceptual location and content. We'll focus on creating the data needed for a Recommendations Analysis Dashboard, and if we have the time move on to other metadata evaluation that can help curators.

No coding is needed to participate, but you may learn some. The lab utilizes Bash and Python with the default packages contained in Anaconda with Python 3.6.

Download the graphical installer for the application before the Lab:  https://www.continuum.io/downloads 

The notebooks and metadata sample sets are located in a Github Repository. https://github.com/scgordon/MILE2

Flash drives will be available for those that did not get a chance to download beforehand if the room's connection.

Speakers | Moderators
avatar for Sean Gordon

Sean Gordon

Metadata Developer, The HDF Group
I co-chair the Documentation Cluster, I'd love to hear from you what directions you'd like to see the cluster headed in the coming year. | I'm excited to run a Metadata Improvement Lab at ESIP that will teach participants to initiate a toolkit for evaluating metadata in any di... Read More →


Thursday July 27, 2017 11:00am - 12:30pm
3 - Georgian 900 E 7th St, Bloomington, IN 47405

11:00am

Come tell us your ESIP Story!
Come in and tell us about your experience with ESIP. We will be taking video and audio to capture ESIP stories and outcomes over the past, nearly, 20 years. 

Speakers | Moderators
avatar for Bruce Caron

Bruce Caron

Community Architect, ESIP


Thursday July 27, 2017 11:00am - 12:30pm
Hoosier

11:00am

Cloud Onboarding
Are you thinking of moving your existing applications to the cloud? Are you designing your next project to use the cloud? There are just too many unknowns when we consider cloud computing for our projects. This session focuses on migrating existing technologies and services to the cloud, and new projects wanting to fully leverage the cloud environment. We invite presentations on lessons learned, software and architectural approaches (e.g. 12-factor), and cost saving strategies. In addition to oral presentations, we encourage poster and demos at during the ESIP poster session.

Current list of invited speakers:
  • Jeff de La Beaujardiere - NOAA on behalf of USGEO DMWG - Presentation of draft USGEO Cloud Computing Recommendations
  • Dan Pilone / Doug Newman - EOSDIS/EED2 - Cloud Onboarding with NGAP
  • Hook Hua - JPL - OCO2 and Sentinel-1A/B analysis onboarding
  • Paul Pilone - E84 - Serverless Data Querying with Amaon Athena

Speakers | Moderators
avatar for Jeff de La Beaujardiere

Jeff de La Beaujardiere

Data Management Architect, NOAA
Jeff de La Beaujardière received his B.A. in physics in 1985 from the University of California, Berkeley, and his Ph.D. in astrophysics in 1990 from the University of Colorado, Boulder. Captivated by the Web’s potential for information distribution, in 1994 he joined the NASA... Read More →
avatar for Thomas Huang

Thomas Huang

Data Scientist, JPL
avatar for Dan Pilone

Dan Pilone

Technical Lead, Element 84, Inc.



Thursday July 27, 2017 11:00am - 12:30pm
4 - Maple 900 E 7th St, Bloomington, IN 47405

11:00am

Evaluating the Options and Forging a Roadmap for Becoming a Trusted Repository
Enabling transparent accountability and establishing trust with its communities are crucial to a data repository because having these characteristics can have significant impact on the repository’s current services as well as long term value. One method for a data repository to demonstrate its commitment and progress towards building its trustworthiness is through gaining an official recognition. However, since there are several standards or certification processes available, it might not always be clear how and why a data repository should follow a particular standard or pursue a specific certificate versus another.

During this session, the session speakers will present their experiences with reviewing and selecting from the various options for formalizing trustworthiness of their repositories, including ISO 16363, nestor Seal, and Data Seal of Approval. The discussion will also encompass the pros and cons, as well as overall value, of undertaking such a certification process.

The session agenda is the following:
  1. Introduction – Matt
  2. “Navigating the Trusted Repository Requirements: A Beginner’s Perspective” – Sophie (NCAR) 
  3. “Adopting World Data System Certification to Improve a Trustworthy Data Repository” - Bob (SEDAC) 
  4. “Certifying Trusted Digital Repositories: USGS Use Case” -John (USGS) 
  5. “NASA’s EOSDIS – Trust and Certification” - Rama (SSAI/NASA ESDIS)
  6. “Keep Up the Good Work: Maintaining Trust Between Certification Assessments” - Shelley (AGU) 
  7. Q & A 

Speakers | Moderators
avatar for John Faundeen

John Faundeen

Archivist, U.S. Geological Survey (USGS)
Education in Geography combined with History minor. Love time and space elements. Very involved in appraisal and preservation activities.
SH

Sophie Hou

Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.
MM

Matthew Mayernik

Project Scientist / Research Data Services Specialist, National Center for Atmospheric Research (NCAR)
avatar for H. K. “Rama” Ramapriyan

H. K. “Rama” Ramapriyan

Research Scientist, Subject Matter Expert, Science Systems and Applications, Inc.
Member, ESIP Data Stewardship Committee; Chair, ESIP Information Quality Cluster; Co-Chair, NASA ESDSWG Data Quality Cluster; NASA ESDIS Project (Assistant Project Manager, Retired).
avatar for Shelley Stall

Shelley Stall

Director, Data Programs, AGU
Ask me about: | AGU's Data Management Assessment Program | FAIR Data | AGU's Data Fair


Thursday July 27, 2017 11:00am - 12:30pm
5 - Oak 900 E 7th St, Bloomington, IN 47405

11:00am

Connecting Communities Part 2: Maximizing value from our multiple memberships
Many ESIP meeting attendees are members of several other informatics-related organizations, for example, AGU Earth and Space Science Informatics (ESSI) Focus Group, Geological Society of America Geoinformatics Division, NOAA Environmental Data Management (EDM), NASA Earth Science Data System Working Groups (ESDSWG), USGS Community for Data Integration (CDI), Research Data Alliance (RDA), EarthCube, and Force11, each which host their own annual or biannual events. At the ESIP 2016 Summer meeting, participants at a working session identified actions that could reduce silo-ing and duplication of effort in these different organizations. One idea was to have leaders and members of multiple organizations meet once a year to discuss ways to coordinate and leverage their events and resources.

In this working session, we will build on last year’s work by discussing the upcoming schedule of meetings and events, identifying widely-applicable outputs to share (such as best practice and guidance documents), and setting future check-in dates to review progress and improvements.

Speakers | Moderators
avatar for Leslie Hsu

Leslie Hsu

Coordinator, Community for Data Integration, US Geological Survey
Talk to me about Communities of Practice, USGS data, collaboration tools, geomorphology, food, basketball.
avatar for Shelley Stall

Shelley Stall

Director, Data Programs, AGU
Ask me about: | AGU's Data Management Assessment Program | FAIR Data | AGU's Data Fair


Thursday July 27, 2017 11:00am - 12:30pm
6 - Walnut 900 E 7th St, Bloomington, IN 47405

11:00am

Educators Workshop: Drones, Data and The Great American Eclipse
Fifteen regional educators will join the ESIP education community at this exciting 1-day workshop. Participating teachers will receive a recreational drone, learn about several data-related activities within the ESIP community, and the August 2017 Eclipse. This full day workshop will take place on Thursday July 27th, the agenda follows:

8:30- Welcome, Introductions, ESIP Overview - Shelley Olds, ESIP Education Chair
9:00- NOAA Data in the Classroom - June Teisan
10:00- Using Mapping Tools to work with Natural Hazards Data- Robert Downs
10:30 - break
11:00- Getting the Most out of GOES-16 Data - Margaret Mooney
11:30- The Great American Eclipse* - Pat Reiff
12:00- Eclipse Outdoor Activities
12:30- LUNCH
1:30 - Drone workshop - Flight basics - Shelley Olds & LuAnn Dahlman
2:00 - Practicing controlled flight
2:30 - Taking photos and videos with your drone
3:00 - Try out an activity
3:30 - Break
4:00 - About sensors
4:30 - Develop action plans
5:00 - Evaluation and wrap-up
5:30 - Adjourn

Workshop Materials

http://wiki.esipfed.org/index.php/Education/2017edworkshop

*everyone is welcome

Speakers | Moderators
avatar for LuAnn Dahlman

LuAnn Dahlman

Science Writer and Editor, NOAA Climate Program Office
avatar for Shelley Olds

Shelley Olds

Science Education Specialist, UNAVCO
Data visualization tools, drones and STEM, 3-D imaging (structure from motion), human dimensions of natural hazards, disaster risk reduction (DRR), resilience building.


Thursday July 27, 2017 11:00am - 12:30pm
7 - Whittenberger Auditorium 900 E 7th St, Bloomington, IN 47405

12:30pm

Lunch | Unconference Intro
Speakers | Moderators
avatar for Bruce Caron

Bruce Caron

Community Architect, ESIP
avatar for Steve Diggs

Steve Diggs

Project Manager, Scripps Institution of Oceanography


Thursday July 27, 2017 12:30pm - 2:00pm
Alumni Hall

2:00pm

Unconference

Details:

  • Sessions are 45 mins; We will run 6 tracks 4x (2 before break/2 after)

  • Submit session ideas on large post-its at front desk starting now - Thurs at lunch!

  • Session submitters will have 30 sec to pitch ideas at lunch Thurs

  • Vote on sessions with dots in your badge

  • Agenda will form at lunch on Thursday

Speakers | Moderators
avatar for Bruce Caron

Bruce Caron

Community Architect, ESIP
avatar for Steve Diggs

Steve Diggs

Project Manager, Scripps Institution of Oceanography


Thursday July 27, 2017 2:00pm - 2:45pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

2:00pm

Educators Workshop: Drones, Data and The Great American Eclipse
Fifteen to twenty regional educators will join the ESIP education community at this exciting 1-day workshop. Participating teachers will receive a recreational drone, learn about several data-related activities within the ESIP community, and the August 2017 Eclipse. This full day workshop will take place on Thursday July 27th, the agenda follows:

8:30- Welcome, Introductions, ESIP Overview - Shelley Olds, ESIP Education Chair
9:00- NOAA Data in the Classroom - June Teisan
10:00- Using Mapping Tools to work with Natural Hazards Data- Robert Downs
10:30 - break
11:00- Getting the Most out of GOES-16 Data - Margaret Mooney
11:30- The Great American Eclipse* - Pat Reiff
12:00- Eclipse Outdoor Activities
12:30 - LUNCH
1:30 - Drone workshop - Flight basics - Shelley Olds & LuAnn Dahlman
2:00 - Practicing controlled flight
2:30 - Taking photos and videos with your drone
3:00 - Try out an activity
3:30 - Break
4:00 - About sensors
4:30 - Develop action plans
5:00 - Evaluation and wrap-up
5:30 - Adjourn

Workshop Materials
http://wiki.esipfed.org/index.php/Education/2017edworkshop

*everyone is welcome

Speakers | Moderators
avatar for LuAnn Dahlman

LuAnn Dahlman

Science Writer and Editor, NOAA Climate Program Office
avatar for Shelley Olds

Shelley Olds

Science Education Specialist, UNAVCO
Data visualization tools, drones and STEM, 3-D imaging (structure from motion), human dimensions of natural hazards, disaster risk reduction (DRR), resilience building.


Thursday July 27, 2017 2:00pm - 3:30pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

2:00pm

Come tell us your ESIP Story!
Come in and tell us about your experience with ESIP. We will be taking video and audio to capture ESIP stories and outcomes over the past, nearly, 20 years. 

Speakers | Moderators
avatar for Bruce Caron

Bruce Caron

Community Architect, ESIP


Thursday July 27, 2017 2:00pm - 3:30pm
Hoosier

2:45pm

Unconference

Details:

  • Sessions are 45 mins; We will run 6 tracks 4x (2 before break/2 after)

  • Submit session ideas on large post-its at front desk starting now - Thurs at lunch!

  • Session submitters will have 30 sec to pitch ideas at lunch Thurs

  • Vote on sessions with dots in your badge

  • Agenda will form at lunch on Thursday

Speakers | Moderators
avatar for Bruce Caron

Bruce Caron

Community Architect, ESIP
avatar for Steve Diggs

Steve Diggs

Project Manager, Scripps Institution of Oceanography


Thursday July 27, 2017 2:45pm - 3:30pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

3:30pm

Networking Break
Thursday July 27, 2017 3:30pm - 4:00pm
Conference Lounge

4:00pm

Unconference

Details:

  • Sessions are 45 mins; We will run 6 tracks 4x (2 before break/2 after)

  • Submit session ideas on large post-its at front desk starting now - Thurs at lunch!

  • Session submitters will have 30 sec to pitch ideas at lunch Thurs

  • Vote on sessions with dots in your badge

  • Agenda will form at lunch on Thursday

Speakers | Moderators
avatar for Bruce Caron

Bruce Caron

Community Architect, ESIP
avatar for Steve Diggs

Steve Diggs

Project Manager, Scripps Institution of Oceanography


Thursday July 27, 2017 4:00pm - 4:45pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

4:00pm

Educators Workshop: Drones, Data and The Great American Eclipse
Fifteen to twenty regional educators will join the ESIP education community at this exciting 1-day workshop. Participating teachers will receive a recreational drone, learn about several data-related activities within the ESIP community, and the August 2017 Eclipse. This full day workshop will take place on Thursday July 27th, the agenda follows:

8:30- Welcome, Introductions, ESIP Overview - Shelley Olds, ESIP Education Chair
9:00- NOAA Data in the Classroom - June Teisan
10:00- Using Mapping Tools to work with Natural Hazards Data- Robert Downs
10:30 - break
11:00- Getting the Most out of GOES-16 Data - Margaret Mooney
11:30- The Great American Eclipse* - Pat Reiff
12:00- Eclipse Outdoor Activities
12:30 - LUNCH
1:30 - Drone workshop - Flight basics - Shelley Olds & LuAnn Dahlman
2:00 - Practicing controlled flight
2:30 - Taking photos and videos with your drone
3:00 - Try out an activity
3:30 - Break
4:00 - About sensors
4:30 - Develop action plans
5:00 - Evaluation and wrap-up
5:30 - Adjourn


Workshop Materials
http://wiki.esipfed.org/index.php/Education/2017edworkshop

*everyone is welcome

Speakers | Moderators
avatar for LuAnn Dahlman

LuAnn Dahlman

Science Writer and Editor, NOAA Climate Program Office
avatar for Shelley Olds

Shelley Olds

Science Education Specialist, UNAVCO
Data visualization tools, drones and STEM, 3-D imaging (structure from motion), human dimensions of natural hazards, disaster risk reduction (DRR), resilience building.


Thursday July 27, 2017 4:00pm - 5:30pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

4:00pm

Come tell us your ESIP Story!
Come in and tell us about your experience with ESIP. We will be taking video and audio to capture ESIP stories and outcomes over the past, nearly, 20 years. 

Contest 1: Raffle at the ESIP Summer Meeting: Be there to win!

Rules:

1) You MUST add your story to the ESIP Slack channel: #myesipstory before 5pm Thursday, July 27, 2017. NOTE: every person who adds a story to Slack gets ONE chance (adding more stories is great, but will not improve your odds). WIN A DJI PHANTOM 3 DRONE!

If you are not already on the ESIP Slack team FIND an Ambassador and use her QR code for your invitation to Slack!

2) You must be present at the 7-9 PM Thursday, July 27, 2017 ESIP FUNding Friday Drawathon at The Tap in Bloomington! If your name is called and you are not there, we will draw another name!

Contest 2: Best Story Wins. You do not need to be at the Summer Meeting for this contest.

Rules:

1) You CAN add your story to the ESIP Slack channel: #myesipstory before 5pm Thursday, July 27, 2017.…

OR, you can TELL your story at the ESIP Story room video event all day Thursday, July 27, 2017 (See the Sched for details). [NOTE: if you tell your story and want to be included in the Contest 1 raffle, you still need to hop on Slack and enter your story.]  WIN A DJI PHANTOM 3 DRONE!

2) You do NOT NEED TO BE PRESENT to win. The ESIP Board will pick a selection team who will determine the best story. HINT: the best story will a story that describes how being an ESIP member has benefited you and/or your organization. Tell not what you did for ESIP, but rather, what ESIP did for you!

3) The Prize will be determined by August 15, 2017, and the Drone will be mailed to the winner.




Speakers | Moderators
avatar for Bruce Caron

Bruce Caron

Community Architect, ESIP


Thursday July 27, 2017 4:00pm - 5:30pm
Hoosier

4:45pm

Unconference

Details:

  • Sessions are 45 mins; We will run 6 tracks 4x (2 before break/2 after)

  • Submit session ideas on large post-its at front desk starting now - Thurs at lunch!

  • Session submitters will have 30 sec to pitch ideas at lunch Thurs

  • Vote on sessions with dots in your badge

  • Agenda will form at lunch on Thursday

Speakers | Moderators
avatar for Bruce Caron

Bruce Caron

Community Architect, ESIP
avatar for Steve Diggs

Steve Diggs

Project Manager, Scripps Institution of Oceanography


Thursday July 27, 2017 4:45pm - 5:30pm
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

7:00pm

 
Friday, July 28
 

8:00am

Registration Open
Friday July 28, 2017 8:00am - 9:00am
East Lounge

8:30am

FUNding Friday Pitch-it
Friday July 28, 2017 8:30am - 9:00am
2 - Frangipani 900 E 7th St, Bloomington, IN 47405

9:30am

What Do We Mean by “Trusted” Data?
This interactive breakout session will build on initial work in the Disasters Lifecycle Cluster addressing Trusted Data concepts (derived from www.tdwi.org). The goal is to develop a working definition of Trusted Data for data sets supporting disasters-related activities, however clarifying the characteristics and features will be relevant to other ESIP data domains. In recent work with the All Hazards Consortium on Data Driven Decision Making, our 3DM workshops, we see a need for a more precise definition of Trusted Data as we discover, test and validate data sets for operational use. We plan to leverage relevant lessons learned from disaster preparedness exercises such as Cascadia Rising, and experience with socio-economic data at CIESIN. We will also review complementary work on Operational Readiness Levels (ORLs) for maturing data sets through the ESIP Geo-Collaborate Testbed, with an end goal of validating or certifying resulting data sets as Trusted.

Speakers | Moderators
avatar for Karen Moe

Karen Moe

Emeritus, NASA ESTO
Co-chair the Disasters Lifecycle Cluster, ESIP Board Member at Large


Friday July 28, 2017 9:30am - 11:00am
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

9:30am

Cross-Institutional Curation and Servicing of Virtual Collections – from Prototype to Operations
A number of data centers have explored various approaches to permit the integrated discovery, access and analysis of Earth science data from different observational platforms and model outputs. The goal of this session is to evaluate proofs-of-concept and prototypes accomplished to date, with the intent identifying a set of requirements for implementing an operational capability to curate such collections across different data centers. These recommendations will focus on virtual collections implemented through NASA's Common Metadata Repository.

Speakers | Moderators
avatar for Amanda Leon

Amanda Leon

DAAC Manager, NASA National Snow and Ice Data Center DAAC



Friday July 28, 2017 9:30am - 11:00am
3 - Georgian 900 E 7th St, Bloomington, IN 47405

9:30am

ESIP Lab Informational Session and Project Report Out
The three funded projects from the Fall 2016 ESIP Lab RFP will report out on their projects. Lear about 1) An Automatic Approach to Building Earth Science Knowledge Graph to Improve Data Discovery 2) Prototyping a Cloud-based High Performance Spatial Web Portal for Parallel Analytics of Big Climate Data, and 3) Cloud-based computing for predicting streamflow in snowmelt dominated watersheds.

Speakers | Moderators
avatar for Annie Burgess

Annie Burgess

ESIP Lab Director, ESIP
avatar for Yongyao Jiang

Yongyao Jiang

George Mason university
I’m Yongyao Jiang, ESIP Discovery cluster student fellow and Ph.D. student at George Mason University in the Spatiotemporal Innovation Center. My research primarily focuses on building an intelligent geospatial data discovery engine to strengthen the ties between Earth observat... Read More →
avatar for Lewis John McGibbney

Lewis John McGibbney

DAAC Engineer, PO.DAAC
My name is Lewis John McGibbney, I am currently a Data Scientist at the NASA Jet Propulsion Laboratory in Pasadena, California where I work in Computer Science and Data Intensive Applications. I enjoy floating up and down the tide of technologies @TheASF having a real enthusiasm... Read More →



Friday July 28, 2017 9:30am - 11:00am
4 - Maple 900 E 7th St, Bloomington, IN 47405

9:30am

New Developments in VR/AR for Science
This has been another exciting year for virtual reality (VR) and augmented reality (AR) technology and there are many individuals and institutions investigating the applicability of these technologies for science. In this breakout session, we will present some of the work that is taking place and connect it to the types of data and focus areas of interest to ESIP members. Additionally, the results of the NASA Earth Science Technology Office (ESTO) Study of VR/AR for Science and potential use cases will be presented.

Speakers | Moderators
avatar for Shayna Skolnik

Shayna Skolnik

Co-founder / CEO, Navteca
Virtual reality, data visualization, science storytelling in VR, cloud computing, entrepreneurship, NASA ESTO Discover AQ project, | creativity + technology = awesome


Friday July 28, 2017 9:30am - 11:00am
5 - Oak 900 E 7th St, Bloomington, IN 47405

9:30am

Council for Data Facilities
Speakers | Moderators
TA

Tim Ahern

Director of Data Services, IRIS
I manage a seismological data center and a broader international data system in seismology and other time series data in general.. IRIS (and seismology in general) is an exemplar in advocating for free and open access to data. I am very interested in methods of simplifying dat... Read More →


Friday July 28, 2017 9:30am - 11:00am
Persimmon

9:30am

Data Management Training Clearinghouse Roundup
Now that the DMT Clearinghouse is up and running, and following on from some user testing of the interface and functionality, we want to provide a number of opportunities for interested participants to help us move the Clearinghouse forward in a wild & woolly working session. Organizers will bring suggestions for educational resources to add to the DMT Clearinghouse, but are also seeking new sources of training materials.  You can prepare for the session ahead of time by bringing information about any educational resources that you think would be useful to researchers and data professionals. At the session, we will work with the attendees to review the suggested resources, submit those appropriate for the Clearinghouse, and allot time to look for other resources to add to our list of resources to submit.
The kind of educational resources we’re looking for could be delivered online, face to face, or using a combination of both techniques.  Examples of possible resources include: remote access tutorials, webinars, videos, audio presentations, exercises or training activities such as specific examples of data use or so-called data “recipes” that could be used to supplement other educational resources, syllabi, lesson plans, teacher guides, related tool training guides (e.g., for DMP Online), etc.  All resources should provide instruction directed to researchers and/or data professionals who wish to train themselves or others on best practices related to the management of research data including but not limited to Earth science data.  In terms of audience, generally, we are looking for resources that would offer instruction to upper level scientists of all types, undergrads / early grad students, post-docs and early career scientists, data professionals such as data managers and data librarians, and citizen scientists.  
For ideas about what to submit, check out what’s already in the DMT Clearinghouse at:  http://dmtclearinghouse.esipfed.org.   At the working session, organizers and experienced submitters will be able to answer questions, show examples, explain decisions made with respect to prior submissions, and celebrate your additions to the inventory as well as collect your feedback on your user experiences with the DMT Clearinghouse website functionality and resource submission form.  As there is a limit to the number of people who can attend remotely, if you wish to attend this session remotely, please let us know that you will be attending on the ESIP Sched app or at:   https://docs.google.com/spreadsheets/d/1TaGX6xJt-JKDtjrA-eUGTXf1i2bm-6wKEgk6W1D54PU/edit?usp=sharing.  For questions, please contact either Nancy Hoebelheinrich (nhoebel@kmotifs.com) or Sophie Hou (hou@ucar.edu).

Speakers | Moderators
avatar for Nancy Hoebelheinrich

Nancy Hoebelheinrich

Principal, Knowledge Motifs LLC
See my LinkedIn profile at: https://www.linkedin.com/in/nancy-hoebelheinrich-0576ba3
SH

Sophie Hou

Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.


Friday July 28, 2017 9:30am - 11:00am
6 - Walnut 900 E 7th St, Bloomington, IN 47405

11:00am

11:30am

What’s the Right Data for this Use Case?
As part of the joint ESIP-All Hazards Consortium (AHC) Data Driven Decision Making (3DM) collaboration, the Disasters Lifecycle cluster is participating in an activity to develop use cases. The AHC group is looking to enhance disaster response decision making with better data. This breakout session will review and refine the process by which ESIP members analyze the AHC use cases to derive key information to identify relevant candidate data sets. We will look at what it will take to prepare and test data sets, and if determined to be useful, address how to make the data sets operational. We will review ongoing work on exercises, such as Cascadia Rising, as well as Operational Readiness Levels (ORLs) for maturing data sets through the ESIP GeoCollaboration Testbed. Ideally we may develop a pathfinder data set to exercise the data use strategy.

Speakers | Moderators
avatar for Karen Moe

Karen Moe

Emeritus, NASA ESTO
Co-chair the Disasters Lifecycle Cluster, ESIP Board Member at Large


Friday July 28, 2017 11:30am - 1:00pm
1 - Dogwood 900 E 7th St, Bloomington, IN 47405

11:30am

Geoscience Jargon Wiki Edit-a-thon: Updating geoscience terms in Wikipedia
Participants in the “Getting Past Jargon” session at the winter 2017 ESIP meeting identified close to 100 terms (https://goo.gl/gO31sP ) that could be considered problematic within Earth science domains. Help us bridge these gaps in understanding! Join us for a communal updating of Wikipedia entries related to these terms. We will give participants some background on how to do this, and hope that this will lead to better understanding of “Geoscience Jargon” within and outside the Earth science community.

Please bring a wifi enabled laptop to the session. We also encourage interested attendees to create a Wikipedia account (https://en.wikipedia.org/w/index.php?title=Special:CreateAccount&returnto=Main+Page ) in advance of arriving in Bloomington, as there is a limit to new editors per IP address per 24 hours.

Speakers | Moderators
avatar for Denise Hills

Denise Hills

Director, Energy Investigations, Geological Survey of Alabama
Long tail data, data preservation, connecting physical samples to digital information, geoscience policy, science communication


Friday July 28, 2017 11:30am - 1:00pm
3 - Georgian 900 E 7th St, Bloomington, IN 47405

11:30am

ESIP Testbed Brainstorm
The Semantic Technology Committee is eager to work with ESIP community members to prototype semantic solutions to real life information technology issues. We will be identifying use cases and requirements that could benefit from semantic solutions, and will be looking for partners with who are interested in investigating the use of existing semantic applications, or prototyping new applications that address those use cases and requirements. We will seek to form partnerships between semantic technology developers and ESIP community members looking for solutions, and will work together to brainstorm possible testbed proposal ideas.

Speakers | Moderators
avatar for Beth Huffer

Beth Huffer

Lingua Logica


Friday July 28, 2017 11:30am - 1:00pm
4 - Maple 900 E 7th St, Bloomington, IN 47405

11:30am

Data Stewardship Committee Business Meeting
The Data Stewardship Committee has been active over the last year in a number of different areas, including Provenance and Context Content Standard, Data Management Training, Information Quality, and Identifiers/Citations. This session will discuss progress on committee activities, and provide an opportunity for making plans and determining roadmap for the next 6 months toward the committee’s strategic objectives:
http://wiki.esipfed.org/index.php/ESIP_Data_Stewardship_Strategic_Plan_Calendar_Year_2017

Speakers | Moderators
MM

Matthew Mayernik

Project Scientist / Research Data Services Specialist, National Center for Atmospheric Research (NCAR)


Friday July 28, 2017 11:30am - 1:00pm
5 - Oak 900 E 7th St, Bloomington, IN 47405

11:30am

Council for Data Facilities
Speakers | Moderators
TA

Tim Ahern

Director of Data Services, IRIS
I manage a seismological data center and a broader international data system in seismology and other time series data in general.. IRIS (and seismology in general) is an exemplar in advocating for free and open access to data. I am very interested in methods of simplifying dat... Read More →


Friday July 28, 2017 11:30am - 1:00pm
Persimmon

11:30am

Strengthening Ties Between the GCMD Keywords and Their User Communities
The goal of this workshop is to strengthen the ties between the GCMD keywords and the user community by determine the user’s needs and prioritizing future keyword releases and features based on those needs. The workshop will include a short presentation on the GCMD keywords, how other communities are using the GCMD keywords, and conclude with a open discussion of the GCMD keywords through an audience interaction activity using Slido!

I. Introduction To GCMD Keywords (20 Minutes)
  • Overview of GCMD Keywords
  • Open Keyword Community
  • Rationale for Keyword Refinement

II. Communities Using GCMD Keywords (30 Minutes)   
  • Use of GCMD Keywords To Augment Search Engine User Queries” (Lewis Mcgibbney, NASA/JPL)
  • Using GCMD Keywords for Data Discovery at the GES DISC” (Carlee Loeser, NASA/GES DISC)
  • More Rules, Less Committees: A Way Forward in Keyword Land” (Ted Habermann, HDF Group)

 

III. Open Discussion of GCMD Keywords (40 Minutes)
  • Audience Interaction Activity Using Slido!

Speakers | Moderators
avatar for Tyler Stevens

Tyler Stevens

Science Coordinator, GCMD



Friday July 28, 2017 11:30am - 1:00pm
6 - Walnut 900 E 7th St, Bloomington, IN 47405

1:00pm

Lunch
Friday July 28, 2017 1:00pm - 2:00pm
Alumni Hall