| Type: | Package |
| Title: | Datasets for the Book Graphical Data Analysis with R |
| Version: | 0.93 |
| Date: | 2015-05-02 |
| Author: | Antony Unwin |
| Maintainer: | Antony Unwin<unwin@math.uni-augsburg.de> |
| Description: | Datasets used in the book 'Graphical Data Analysis with R' (Antony Unwin, CRC Press 2015). |
| Depends: | R (≥ 2.10) |
| License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] |
| Suggests: | ggplot2 |
| LazyData: | yes |
| NeedsCompilation: | no |
| Packaged: | 2015-05-02 09:42:33 UTC; antonyunwin |
| Repository: | CRAN |
| Date/Publication: | 2015-05-02 14:11:23 |
Top performances in the Decathlon from 1985 to 2006.
Description
The point scoring system for the Decathlon last changed in 1985. Best annual performances of 6800 points and over for a twenty-one year period after the new rules were introduced were downloaded from the excellent Estonian website Decathlon2000. Handtimed performances were not included. Names with accents have been simplified.
Usage
data(Decathlon)
Format
A data frame with 7968 observations on the following 24 variables.
Totalpointsthe total points achieved over all 10 events
DecathleteNameDecathlete's name
NationalityDecathlete's nationality
m100Time for the 100 metres (secs)
LongjumpDistance jumped (metres)
ShotputDistance putting the shot (metres)
HighjumpHeight jumped (metres)
m400Time for the 400 metres (secs)
m110hurdlesTime for the 110 metres hurdles (secs)
DiscusDistance throwing the discus (metres)
PolevaultHeight achieved (metres)
JavelinDistance throwing the javelin (metres)
m1500Time for the 1500 metres (secs)
yearEventYear of performance
P100mPoints for performance in 100 metres
PljPoints for performance in long jump
PspPoints for performance in putting the shot
PhjPoints for performance in high jump
P400mPoints for performance in 400 metres
P110hPoints for performance in 110 metres hurdles
PpvPoints for performance in pole vault
PdtPoints for performance in discus
PjtPoints for performance in javelin
P1500Points for performance in 1500 metres
Source
Examples
data(Decathlon, package="GDAdata")
summary(Decathlon[, grep("P.*", names(Decathlon))])
library(ggplot2)
ggplot(Decathlon, aes(Plj)) + geom_histogram()
ggplot(Decathlon, aes(P100m, Plj)) + geom_point()
Figures for the trade between England and the East Indies in the 18th century.
Description
The data have been estimated from the graphic in the first edition of Playfair's Commercial and Political Atlas by the website 'Me, myself, and BI'.
Usage
data(EastIndiesTrade)
Format
A data frame with 81 observations on the following 3 variables.
Yearthe data go from 1700 to 1780
ExportsExports from England to the East Indies (millions of pounds)
ImportsImports to England from the East Indies (millions of pounds)
Source
http://blog.bissantz.com/vis-a-vis
Examples
data(EastIndiesTrade, package="GDAdata")
library(ggplot2)
ggplot(EastIndiesTrade, aes(x=Year, y=Exports-Imports)) + geom_line()
Star data useful for drawing a Hertzsprung-Russell diagram.
Description
Hertzsprung-Russell diagrams plot star luminosity (brightness) against temperature (colour). The first one was drawn just over 100 years ago. The dataset is the Yale Trigonometric Parallax Dataset and this version can be found on the webpage of the Astronomy Department of Case Western Reserve University.
Usage
data(HRstars)
Format
A data frame with 6220 observations on the following 5 variables.
IDstar ID number
Vapparent V magnitude
BVobserved B-V color
Paraobserved parallax (in arcsec)
Uncertuncertainty in parallax (in milliarcsec)
Source
http://burro.astr.cwru.edu/Academics/Astr221/HW/HW5/HW5.html
Examples
data(HRstars, package="GDAdata")
with(HRstars, hist(BV))
with(HRstars, hist(V))
Data from the longjump final in the 1968 Mexico Olympics.
Description
The best longjumps by the 16 finalists in the 1968 Mexico Olympics. Each athlete jumped up to six times, though the winner of the Gold Medal, Bob Beamon, only jumped twice.
Usage
data(MexLJ)
Format
A data frame with 16 observations on the following variable.
JumpDistance jumped measured in metres
Source
http://en.wikipedia.org/wiki/Athletics_at_the_1968_Summer_Olympics_-_Men's_long_jump
Examples
data(MexLJ, package="GDAdata")
with(MexLJ, summary(Jump))
with(MexLJ, hist(Jump,breaks=seq(7.25,9,0.25)))
World Speed Skiing Competition, Verbier 21st April, 2011
Description
There were separate Speed Skiing competitions for men (79 participants) and women (12 participants).
Usage
data(SpeedSki)
Format
A data frame with 91 observations on the following 8 variables.
RankFinishing position by sex
BibStart number
FIS.CodeSkier's international skiing ID number
NameSkier's name
YearSkier's year of birth
NationSkier's nationality
SpeedSpeed achieved in km/hr
SexFemale or Male
EventSpeed Downhill, Speed Downhill Junior or Speed One
no.of.runsNo of runs
Source
http://www.fis-ski.com/de/606/612.html?sector=SS&raceid=262 (men)
http://www.fis-ski.com/de/606/612.html?sector=SS&raceid=263 (women)
Examples
data(SpeedSki, package="GDAdata")
with(SpeedSki, summary(Speed))
library(ggplot2)
ggplot(SpeedSki, aes(Speed)) + geom_histogram(binwidth=5)
Nutritional value of food.
Description
Nutritional value of different foods based on standard serving sizes.
Usage
data(foodnames)
Format
A data frame with 961 observations on the following 9 variables.
Namename of food (not unique)
Measureserving description
Fat.gramsgrams of fat in a standard serving
Food.energy.caloriescalories per serving
Carbohydrates.gramsgrams of carbohydrates per serving
Protein.gramsgrams of protein per serving
Cholesterol.mgcholesterol in mg per serving
weight.gramsweight in grams of a standard serving
Saturated.fat.gramsgrams of saturated fat per serving
Source
The data are used in A. Izenman (2008), Modern Multivariate Statistical Techniques, Springer
and are available on the accompanying website
http://astro.temple.edu/~alan/MMST/
Examples
data(foodnames, package="GDAdata")
summary(foodnames)
library(ggplot2)
ggplot(foodnames, aes(Fat.grams, Saturated.fat.grams)) + geom_point()
The Guardian University League Table 2013
Description
The Guardian newspaper in the UK publishes a ranking of British universities each year and it reported these data in May, 2012 as a guide for 2013.
Usage
data(uniranks)
Format
A data frame with 120 observations on the following 13 variables.
RankRank of the University
InstitutionUniversity name
UniGroupUniversities can be a member of one of five groups,
1994 Group,Guild HE,Million+,Russell,University Alliance, or noneHesaCodeUniversity's Higher Education Statistics Agency code
AvTeachScoreAverage Teaching Score
NSSTeachingUniversity's National Student Survey teaching score
NSSOverallUniversity's NSS overall score
SpendPerStudentUniversity expenditure per student (depends on subject)
StudentStaffRatioStudent to Staff ratio
CareerProspectsProportion of graduates in appropriate level employment or full-time study within six months of graduation
ValueAddScore”Based upon a sophisticated indexing methodology that tracks students from enrolment to graduation, qualifications upon entry are compared with the award that a student receives at the end of their studies.” (Guardian)
EntryTariffValue dependent on the average points needed to get on the university's courses
NSSFeedbackUniversity's NSS feedback score
Source
http://www.theguardian.com/news/datablog/2012/may/22/university-guide-2013-guardian-data
Examples
data(uniranks, package="GDAdata")
with(uniranks, table(UniGroup))
library(ggplot2)
ggplot(uniranks, aes(x=NSSTeaching, y=NSSFeedback)) + geom_point()
ggplot(uniranks, aes(x=UniGroup, y=SpendPerStudent)) + geom_boxplot()