Lterdatasampler
LTER data samples to teach environmental data science
Install / Use
/learn @lter/LterdatasamplerREADME
lterdatasampler <a href='https://lter.github.io/lterdatasampler/'><img src="man/figures/logo.png" id="home_logo" align="right" height="180"/></a>
The mission of the Long Term Ecological Research program (LTER) Network is to “provide the scientific community, policy makers, and society with the knowledge and predictive understanding necessary to conserve, protect, and manage the nation’s ecosystems, their biodiversity, and the services they provide.” A specific goal of the LTER is education and training - “to promote training, teaching, and learning about long-term ecological research and the Earth’s ecosystems, and to educate a new generation of scientists.”
The goal of this package is to provide a sampler to gather feedback from the community of what will be a larger package containing 28 datasets - one from each of the existing US LTER sites. Those datasets are subsets of the original data and have been updated - sometimes substantially - from the raw data. They are aimed to be useful for teaching and training in environmental data science. This content is thus not suitable for research and should only be used for teaching purposes.
We encourage you to explore existing LTER teaching and training initiatives, and the many other available LTER datasets which can be accessed via the Environmental Data Initiative. Please contact cited researchers directly to discuss using data for research purposes or in publication.
Installation
You can install the CRAN version of lterdatasampler with:
install.packages("lterdatasampler")
You can install the development version of lterdatasampler from GitHub
with:
# install.packages("remotes")
remotes::install_github("lter/lterdatasampler")
The dataset samples
Dataset samples currently included in the package are summarized below; see individual Articles for data and source details. Note: the three letter prefix for each dataset indicates the LTER site (see full list of site abbreviations).
and_vertebrates: Records for aquatic vertebrates (cutthroat trout and salamanders) in Mack Creek, Andrews Experimental Forest, Oregon (1987 - present)arc_weather: Daily meteorological (e.g. air temperature, precipitation) records from Toolik Field Station, Alaska (1988 - present)hbr_maples: Sugar maple seedlings at Hubbard Brook Experimental Forest (New Hampshire) in calcium-treated and reference watersheds in August 2003 and June 2004knz_bison: Bison masses recorded for the herd at Konza Prairie Biological Station LTERluq_streamchem: stream chemistry data for the Quebrada Sonadora (QS) location part of the Luqillo tropical forest LTER sitentl_icecover: Ice freeze and thaw dates for Madison, Wisconsin Area lakes (1853 - 2019), North Temperate Lakes LTERntl_airtemp: Daily average air temperature data for Madison, Wisconsin (1869 - 2019), North Temperate Lakes LTERnwt_pikas: Pika observations for habitat and stress analysis at Niwot Ridge LTER, Coloradopie_crab: Fiddler crab body size recorded summer 2016 in salt marshes from Florida to Massachusetts including Plum Island Ecosystem LTER, Virginia Coast LTER, and NOAA’s National Estuarine Research Reserve System
Which data sample should I use?
These data samples are selected because they have features we feel are commonly useful in introductory environmental data science and statistics courses.
In the table below, we list some introductory methods / skills, then share which data samples in this package we think are well-suited to use when teaching or learning them! It is not comprehensive - there are many different analyses & skills that these data samples would facilitate. Here we highlight a few that we think would be commonly useful
Recommended data samples for introducing selected topics
<table class="gt_table" data-quarto-disable-processing="false" data-quarto-bootstrap="false"> <thead> <tr class="gt_col_headings"> <th class="gt_col_heading gt_columns_bottom_border gt_left" rowspan="1" colspan="1" scope="col" id=""></th> <th class="gt_col_heading gt_columns_bottom_border gt_center" rowspan="1" colspan="1" scope="col" id="Data sample">Data sample</th> <th class="gt_col_heading gt_columns_bottom_border gt_left" rowspan="1" colspan="1" scope="col" id="For example you could:">For example you could:</th> </tr> </thead> <tbody class="gt_table_body"> <tr class="gt_row_group_first"><td headers="Linear relationships stub_1_1 stub_1" rowspan="3" class="gt_row gt_left gt_stub_row_group" style="vertical-align:middle">Linear relationships</td> <td headers="Linear relationships stub_1_1 full_link" class="gt_row gt_center"><a href = "https://lter.github.io/lterdatasampler/articles/pie_crab_vignette.html">`pie_crab`</a></td> <td headers="Linear relationships stub_1_1 data_description" class="gt_row gt_left">Model the relationship between fiddler crab size and latitude using `pie_crab` , while learning about Bergmann's Rule!</td></tr> <tr><td headers="Linear relationships full_link_2 full_link" class="gt_row gt_center"><a href = "https://lter.github.io/lterdatasampler/articles/ntl_icecover_vignette.html">`ntl_icecover`</a></td> <td headers="Linear relationships full_link_2 data_description" class="gt_row gt_left">Investigate the relationship between winter temperatures and ice cover duration for Wisconsin lakes using `ntl_icecover`</td></tr> <tr><td headers="Linear relationships full_link_3 full_link" class="gt_row gt_center"><a href = "https://lter.github.io/lterdatasampler/articles/hbr_maples_vignette.html">`hbr_maples`</a></td> <td headers="Linear relationships full_link_3 data_description" class="gt_row gt_left">Explore seedling height-mass relationships for sugar maples using `hbr_maples`</td></tr> <tr class="gt_row_group_first"><td headers="Non-linear relationships stub_1_4 stub_1" rowspan="2" class="gt_row gt_left gt_stub_row_group" style="vertical-align:middle">Non-linear relationships</td> <td headers="Non-linear relationships stub_1_4 full_link" class="gt_row gt_center"><a href = "https://lter.github.io/lterdatasampler/articles/knz_bison_vignette.html">`knz_bison`</a></td> <td headers="Non-linear relationships stub_1_4 data_description" class="gt_row gt_left">Model the relationship between bison age and mass for male and female bison using `knz_bison`, for example estimating parameters in the Gompertz model</td></tr> <tr><td headers="Non-linear relationships full_link_5 full_link" class="gt_row gt_center"><a href = "https://lter.github.io/lterdatasampler/articles/and_vertebrates_vignette.html">`and_vertebrates`</a></td> <td headers="Non-linear relationships full_link_5 data_description" class="gt_row gt_left">Model the length-mass relationships for cutthroat trout and salamanders in Mack Creek, Oregon</td></tr> <tr class="gt_row_group_first"><td headers="Time series analysis stub_1_6 stub_1" rowspan="2" class="gt_row gt_left gt_stub_row_group" style="vertical-align:middle">Time series analysis</td> <td headers="Time series analysis stub_1_6 full_link" class="gt_row gt_center"><a href = "https://lter.github.io/lterdatasampler/articles/arc_weather_vignette.html">`arc_weather`</a></td> <td headers="Time series analysis stub_1_6 data_description" class="gt_row gt_left">Explore seasonality, wrangling dates, or practice forecasting using daily meteorological records from Toolik Station, Alaska</td></tr> <tr><td headers="Time series analysis full_link_7 full_link" class="gt_row gt_center"><a href = "https://lter.github.io/lterdatasampler/articles/luq_streamchem_vignette.html">`luq_streamchem`</a></td> <td headers="Time series analysis full_link_7 data_description" class="gt_row gt_left">Investigate the impact of a hurricane on stream water chemistry</td></tr> <tr class="gt_row_group_first"><td headers="Spatial data introduction stub_1_8 stub_1" rowspan="1" class="gt_row gt_left gt_stub_row_group" style="vertical-align:middle">Spatial data introduction</td> <td headers="Spatial data introduction stub_1_8 full_link" class="gt_row gt_center"><a href = "https://lter.github.io/lterdatasampler/articles/nwt_pikas_vignette.html">`nwt_pikas`</a></td> <td headers="Spatial data introduction stub_1_8 data_description" class="gt_row gt_left">Introduce basics of spatial data (e.g. CRS, projections) and tools for working with spatial data by visualizing pika locations at Niwot Ridge in the Colorado Rockies</td></tr> <tr class="gt_row_group_first"><td headers="Comparing groupsRelated Skills
feishu-drive
345.4k|
things-mac
345.4kManage Things 3 via the `things` CLI on macOS (add/update projects+todos via URL scheme; read/search/list from the local Things database)
clawhub
345.4kUse the ClawHub CLI to search, install, update, and publish agent skills from clawhub.com
codebase-memory-mcp
1.1kHigh-performance code intelligence MCP server. Indexes codebases into a persistent knowledge graph — average repo in milliseconds. 66 languages, sub-ms queries, 99% fewer tokens. Single static binary, zero dependencies.
