Programming Homework Help

California University Big data analytics with R exercise Questions on a RMD File

 

  1. What does it mean to “calculate a statistic of interest for each level of a factor”?
  2. What does the read.table.ffdf() function do to enable R to work with large data sets? Describe the resulting data structure
  3. Compare read.table() with read.table.ffdf() in terms of speed, size of output, usage of ram
  4. Errata:
      • be sure to set your temp dir inside your working director and identify it for R with the options() function.
      • if your R can’t find any functions that end in .ffdf, prepend those functions with ff::: like so ff:::dimnames.ffdf()
  5. Calculate the mean departure delay for each city of origin.

Since study pool does not allow to add csv the files to work on this and code are found here: https://github.com/PacktPublishing/Big-Data-Analyt…

Select the airline_id CSV and flights_sep_oct15.zip from the link above, I cannot just copy-paste the code as given in the text since it will show 100% copied. https://github.com/PacktPublishing/Big-Data-Analyt…

Work your magic and answer all the questions above.

Please note a previous tutor without asking anything cancelled this request stating “Student did not send necessary information for me to complete the answer.” While I had written the same thing with all the necessary. So do not request more days other than the requirement. This is a simple exercise and let me know if you have any questions. Thank you!