Projects‎ > ‎

Project 4

Implement the script you planned out two weeks ago.
  • Open two data files
  • Reads the files and
    • Merges the data into a single output file
    • Removes duplicate data
    • The output file needs to be sorted by the data key
Some notes:
  • We'll assume that every line in the input files is one data record
  • The two files can have the same format
  • To detect duplicates and to sort the data, each record has to be identified somehow:
    • You can compare the entire data record, or
    • You can extract a "key" value from the data, i.e. one field in the data is used to compare and sort
      • A "field" in the data can be a range of columns, or one item in a tab or comma delimited file
Create your own input files; you will submit them along with the script. Two sample files are attached - you can use these for testing.
The sample files show curriculum actions; the files are comma-delimited.  The third field is the key in this case.
Hans Kruse,
Sep 17, 2018, 5:16 AM
Hans Kruse,
Sep 17, 2018, 5:16 AM