Hone your R skills by doing problems.
Please attempt all questions.
The assignment is due Friday 10 November and you are encouraged to work in group and to hand in a single copy for the group.
Make reasonable column names and convert the columns into the correct data types. Name the resulting data frame ‘rail’. Show me part of the results with head(rail).
Add a column to the data frame that records the relative frequency of station per country.
Create a plot of total number of station per year. Do you think any year is an outlier? Remove the outlier(s) and recreate the plot.
president <- quanteda::data_corpus_inaugural
Report your results. Do you notice any pattern?
Are they stopwords? Punctuation?
Which one? What is the context?