Type: | Package |
Title: | Teaching Data for Statistics and Data Science |
Version: | 0.5.3 |
Description: | Provides data sets for teaching statistics and data science courses. It includes a sample of data from John Edmund Kerrich's famous coinflip experiment. These are data that I used for statistics. The package also contains an R Markdown template with the required formatting for assignments in my former courses. |
License: | GPL-3 |
URL: | https://chris-prener.github.io/testDriveR/, https://github.com/chris-prener/testDriveR |
BugReports: | https://github.com/chris-prener/testDriveR/issues |
Encoding: | UTF-8 |
LazyData: | true |
RoxygenNote: | 7.3.2 |
Suggests: | ggplot2, knitr, rmarkdown, testthat |
NeedsCompilation: | no |
Packaged: | 2025-02-02 19:08:56 UTC; chris |
Author: | Christopher Prener
|
Maintainer: | Christopher Prener <chris.prener@gmail.com> |
Repository: | CRAN |
Date/Publication: | 2025-02-02 19:20:02 UTC |
Model Year 2017 Vehicles
Description
A data set containing model year 2017 vehicles for sale in the United States.
Usage
data(auto17)
Format
A data frame with 1216 rows and 21 variables:
- id
DOT vehicle ID number
- mfr
vehicle manufacturer
- mfrDivision
vehicle brand
- carLine
vehicle name
- carClass
vehicle type, numeric
- carClassStr
vehicle type, string
- cityFE
fuel economy, city
- hwyFE
fuel economy, highway
- combFE
fuel economy, combined
- guzzlerStr
poor fuel economy
- fuelStr
fuel, abbrev.
- fuelStr2
fuel, full
- fuelCost
estimated fuel cost
- displ
engine displacement
- transStr
transmission, full
- transStr2
transmission, abbrev.
- gears
number of gears
- cyl
number of cylinders
- airAsp
air aspiration method
- driveStr
vehicle drive type, abbrev.
- driveStr2
vehicle drive type, full
Source
https://www.fueleconomy.gov/feg/download.shtml
Examples
str(auto17)
head(auto17)
UNICEF Childhood Mortality Data
Description
A data set containing time series data by country for estimated under-5, infant, and neonatal mortality rates.
Usage
data(childMortality)
Format
A data frame with 28982 rows and 6 variables:
- countryISO
two-letter country code
- countryName
full name of country
- continent
name of continent
- category
type of mortality rate -
infant_MR
,child_MR
, orunder5_MR
- year
year of estimate
- estimate
estimated mortality rate
Source
https://childmortality.org
Examples
str(childMortality)
2014 General Social Survey
Description
A data set containing data on work, salary, and education from the 2014 General Social Survey. Missing data are explicitly identified with NAs and all data are represented as factors when appropriate.
Usage
data(gss14)
Format
A data frame with 2538 rows and 19 variables:
- YEAR
GSS year for this respondent
- INCOME06
Total family income (2006 version)
- INCOM16
Rs family income when 16 yrs old
- REG16
Region of residence, age 16
- RACE
Race of respondent
- SEX
Respondents sex
- SPDEG
Spouses highest degree
- MADEG
Mothers highest degree
- PADEG
Fathers highest degree
- DEGREE
Rs highest degree
- CHILDS
Number of children
- SPWRKSLF
Spouse self-emp. or works for somebody
- SPHRS1
Number of hrs spouse worked last week
- MARITAL
Marital status
- WRKSLF
R self-emp or works for somebody
- HRS1
Number of hours worked last week
- WRKSTAT
Labor force status
- ID_
Respondent id number
- BALLOT
Ballot used for interview
Source
https://gssdataexplorer.norc.org
Examples
str(gss14)
head(gss14)
2014 General Social Survey (Simplified)
Description
A data set containing data on work, salary, and education from the 2014 General Social Survey. Missing data are not explicitly identified with NAs and all data are represented numerically instead of as factors when appropriate.
Usage
data(gss14_simple)
Format
A data frame with 2538 rows and 19 variables:
- YEAR
GSS year for this respondent
- INCOME06
Total family income (2006 version)
- INCOM16
Rs family income when 16 yrs old
- REG16
Region of residence, age 16
- RACE
Race of respondent
- SEX
Respondents sex
- SPDEG
Spouses highest degree
- MADEG
Mothers highest degree
- PADEG
Fathers highest degree
- DEGREE
Rs highest degree
- CHILDS
Number of children
- SPWRKSLF
Spouse self-emp. or works for somebody
- SPHRS1
Number of hrs spouse worked last week
- MARITAL
Marital status
- WRKSLF
R self-emp or works for somebody
- HRS1
Number of hours worked last week
- WRKSTAT
Labor force status
- ID_
Respondent id number
- BALLOT
Ballot used for interview
Source
https://gssdataexplorer.norc.org
Examples
str(gss14_simple)
head(gss14_simple)
Kerrich Coin Toss Trial Outcomes
Description
A data set containing 2,000 trials of coin flips from statistician John Edmund Kerrich's 1940s experiments while imprisoned by the Nazis during World War Two.
Usage
data(kerrich)
Format
A data frame with 1216 rows and 21 variables:
- id
trial
- outcome
outcome of each trial; TRUE = heads, FALSE = tails
- average
cumulative mean of outcomes
Source
https://stats.stackexchange.com/questions/76663/john-kerrich-coin-flip-data/77044#77044
https://books.google.com/books/about/An_experimental_introduction_to_the_theo.html?id=JBTvAAAAMAAJ&hl=en
References
https://en.wikipedia.org/wiki/John_Edmund_Kerrich
Examples
str(kerrich)
if (require("ggplot2")) {
ggplot(data = kerrich) +
geom_hline(mapping = aes(yintercept = .5, color = "p(heads)")) +
geom_line(mapping = aes(x = id, y = average)) +
ylim(0,1)
}