What this means is that we are also in different places in our R journey - from beginner to advanced.
What this means is that we are also in different places in our R journey - from beginner to advanced.
Typically, we have two main types of air quality measurements.
This is what stationary monitoring network looks like. Multiple sensors recording data at real time ~ 1 min frequency. Gives information about trends at the city scale
For a neighborhood level measurement, we use mobile monitoring. This schematic shows the many instruments that go in the mobile platform. You can imagine the complexity, diff instruments, have different download methods, from the point of view of analysis - different data formats, different time stamps, etc.
We are looking at the inside of the car now. We have a laptop as a logger for some instruments. And all insruments are secured in the blue tray with inlets outside the vehicle. This is what looks like in pracise,...
All that to say that many times, things are out of our control in the field. Next,
I will talk about some of our favourite R packages and functions and what we use them for
We use all tidyverse packages for data cleaning. Ggplot for highly flexible plotting, purrr and map functions for more efficient and faster code instead of for loops, and forcats for functions. Readr, read_csv automatically parses date-time objects, that is is very helpful
lubridate
Data from different instruments often have different...
ymd_hms()
, dmy_hms()
...with_tz()
Step 1 is consistent time stamps so that we can join all data sets together. Some instruments are either in UTC time zone, or depending on the country of origin in a diff time zone.
janitor
## [1] "Elevation" "Year" ## [3] "Month" "Day" ## [5] "Season" "Julian Day" ## [7] "PM2.5-11hrs" "PM2.5 -14hrs" ## [9] "PM2.5 -dailymean" "PM2.5-10-14 hrs mean" ## [11] "AOD-Terra" "AOD-Aqua" ## [13] "AOD-Terra-Aqua mean" "NDVI" ## [15] "CWV-Terra" "CWV-Aqua" ## [17] "CWV-Terra-Aqua mean" "2m Temperatue-11hrs" ## [19] "2m Temperature -14hrs" "2m Temperature-dailymean" ## [21] "2m Temperature-10-14 hrs mean"
library(janitor)my_data <- my_data %>% clean_names()names(my_data)
## [1] "elevation" "year" ## [3] "month" "day" ## [5] "season" "julian_day" ## [7] "pm2_5_11hrs" "pm2_5_14hrs" ## [9] "pm2_5_dailymean" "pm2_5_10_14_hrs_mean" ## [11] "aod_terra" "aod_aqua" ## [13] "aod_terra_aqua_mean" "ndvi" ## [15] "cwv_terra" "cwv_aqua" ## [17] "cwv_terra_aqua_mean" "x2m_temperatue_11hrs" ## [19] "x2m_temperature_14hrs" "x2m_temperature_dailymean" ## [21] "x2m_temperature_10_14_hrs_mean"
There are several inconsistent features here. Use of dash vs space, colum names starting with numbers, capital letters vs small letters.
leaflet
demo we are looking at averages over several rides, so these are stable maps. You may recall the purpose of mobile monitoring is high resolution pollution maps combining location and pollution data
Base R
lm()
glm()
prcomp()
lme4
package
gam
packagecool plot options.
There are plenty of repeat tasks. Mapping each neighborhood 20+ times and then mapping several neighborhoods
https://github.com/meenakshi-kushwaha/mmaqshiny
Option to upload each file and automatically joins them. There are alarm and settings so we know if something is wrong. and time series plots for daily qa/qc
https://github.com/adithirgis/pollucheck
another shiny app that we have developed aims to simplify cleaning data from open access data from stationary monitors and automatically generate statistical plots.
Pitch no - 26
+
+
project description could be collection dates, locations, etc.
Code and outputs side by side
Facilitates sharing and reviewing
Repdroduce analysis easily
Different output formats possible
a <- 2
I have `r a` cats and `r a+1` dogs.
I have 2 cats and 3 dogs.
---title: "TOC"output: html_document: toc: true toc_float: true---
Individual diagnostic reports using parameters
---title: "Diagnostic Report" output: html_documentparams: year: 2021 region: Site_1 data: file.csv---
If someone had said, join R because of amazing community, i would never have believed it. I just needed technology to learn to make cool plots.
https://tinystats.github.io/teacups-giraffes-and-statistics/index.html
By Walum & Leon
"A delightful series of modules to learn statistics and R coding for students, scientists, and stats-enthusiasts"
Using projects and version control for organizing
Following tidyverse style guide for code
Commenting for "why" of code and new packages
Using relative file paths and avoid hard-coding file locations
here
instead of setwd()
Bookdown project
Tidymodels
Accessibility
Now that we have the lu
What this means is that we are also in different places in our R journey - from beginner to advanced.
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
o | Tile View: Overview of Slides |
Esc | Back to slideshow |
What this means is that we are also in different places in our R journey - from beginner to advanced.
Typically, we have two main types of air quality measurements.
This is what stationary monitoring network looks like. Multiple sensors recording data at real time ~ 1 min frequency. Gives information about trends at the city scale
For a neighborhood level measurement, we use mobile monitoring. This schematic shows the many instruments that go in the mobile platform. You can imagine the complexity, diff instruments, have different download methods, from the point of view of analysis - different data formats, different time stamps, etc.
We are looking at the inside of the car now. We have a laptop as a logger for some instruments. And all insruments are secured in the blue tray with inlets outside the vehicle. This is what looks like in pracise,...
All that to say that many times, things are out of our control in the field. Next,
I will talk about some of our favourite R packages and functions and what we use them for
We use all tidyverse packages for data cleaning. Ggplot for highly flexible plotting, purrr and map functions for more efficient and faster code instead of for loops, and forcats for functions. Readr, read_csv automatically parses date-time objects, that is is very helpful
lubridate
Data from different instruments often have different...
ymd_hms()
, dmy_hms()
...with_tz()
Step 1 is consistent time stamps so that we can join all data sets together. Some instruments are either in UTC time zone, or depending on the country of origin in a diff time zone.
janitor
## [1] "Elevation" "Year" ## [3] "Month" "Day" ## [5] "Season" "Julian Day" ## [7] "PM2.5-11hrs" "PM2.5 -14hrs" ## [9] "PM2.5 -dailymean" "PM2.5-10-14 hrs mean" ## [11] "AOD-Terra" "AOD-Aqua" ## [13] "AOD-Terra-Aqua mean" "NDVI" ## [15] "CWV-Terra" "CWV-Aqua" ## [17] "CWV-Terra-Aqua mean" "2m Temperatue-11hrs" ## [19] "2m Temperature -14hrs" "2m Temperature-dailymean" ## [21] "2m Temperature-10-14 hrs mean"
library(janitor)my_data <- my_data %>% clean_names()names(my_data)
## [1] "elevation" "year" ## [3] "month" "day" ## [5] "season" "julian_day" ## [7] "pm2_5_11hrs" "pm2_5_14hrs" ## [9] "pm2_5_dailymean" "pm2_5_10_14_hrs_mean" ## [11] "aod_terra" "aod_aqua" ## [13] "aod_terra_aqua_mean" "ndvi" ## [15] "cwv_terra" "cwv_aqua" ## [17] "cwv_terra_aqua_mean" "x2m_temperatue_11hrs" ## [19] "x2m_temperature_14hrs" "x2m_temperature_dailymean" ## [21] "x2m_temperature_10_14_hrs_mean"
There are several inconsistent features here. Use of dash vs space, colum names starting with numbers, capital letters vs small letters.
leaflet
demo we are looking at averages over several rides, so these are stable maps. You may recall the purpose of mobile monitoring is high resolution pollution maps combining location and pollution data
Base R
lm()
glm()
prcomp()
lme4
package
gam
packagecool plot options.
There are plenty of repeat tasks. Mapping each neighborhood 20+ times and then mapping several neighborhoods
https://github.com/meenakshi-kushwaha/mmaqshiny
Option to upload each file and automatically joins them. There are alarm and settings so we know if something is wrong. and time series plots for daily qa/qc
https://github.com/adithirgis/pollucheck
another shiny app that we have developed aims to simplify cleaning data from open access data from stationary monitors and automatically generate statistical plots.
Pitch no - 26
+
+
project description could be collection dates, locations, etc.
Code and outputs side by side
Facilitates sharing and reviewing
Repdroduce analysis easily
Different output formats possible
a <- 2
I have `r a` cats and `r a+1` dogs.
I have 2 cats and 3 dogs.
---title: "TOC"output: html_document: toc: true toc_float: true---
Individual diagnostic reports using parameters
---title: "Diagnostic Report" output: html_documentparams: year: 2021 region: Site_1 data: file.csv---
If someone had said, join R because of amazing community, i would never have believed it. I just needed technology to learn to make cool plots.
https://tinystats.github.io/teacups-giraffes-and-statistics/index.html
By Walum & Leon
"A delightful series of modules to learn statistics and R coding for students, scientists, and stats-enthusiasts"
Using projects and version control for organizing
Following tidyverse style guide for code
Commenting for "why" of code and new packages
Using relative file paths and avoid hard-coding file locations
here
instead of setwd()
Bookdown project
Tidymodels
Accessibility
Now that we have the lu