Starting from:

$35

Specification 3 - Using Multiple Classes - Baby Names

Specification 3 - Using Multiple Classes - Baby Names

General Assignment Requirements
The purposes of this assignment are

to practice using Java ArrayLists
to implement a program that uses multiple classes. (object based programming, encapsulation)
to practice processing input
to practice implementing individual classes based on a given specification
Files:

 
File
Responsibility
Source Code
NameSurfer.java
Provided by me and you
Source Code
NameRecord.java
Provided by you.
Source Code
Names.java
Provided by me and you.
data files
names.txt (The primary data file.) names2.txt (Another data file with a different starting decade and a different number of ranks per name.)
Provided by me.
sample run log
nameSurferLog.txt Your output shall match this example.
Provided by me.
Submission
a3.zip with NameSurfer.java, NameRecord.java, and Names.java
Provided by you
Description: This assignment is based on an assignment created by Nick Parlante and Stuart Reges' version of that assignment..

For this assignment you may not share your tests, but you can share your results when using different data files such as names2.txt.

Complete a program that allows a user to query a data base of the 1000 most popular baby names in the United States per decade for a given number of decades under the constraints of the General Assignment Requirements and as described below.

As always, you may add helper methods and should do so to provide structure to the program and reduce redundancy.

One additional constraint: You must use the ArrayList class in the solution as discussed below. You are encouraged to use methods from the ArrayList class and String class to make life easier when possible. The ArrayList class has some helpful methods and one of the goals of this assignment is to learn those methods.

Your program processes a file with data obtained from the Social Security Administration.  SSA has a web site showing the distribution of names chosen for children over the last 100 years in the US(www.ssa.gov/OACT/babynames/).

The data file your are given represents the 1000 most popular male and female names for kids born in the US going back to 1900.

The first two lines of the file are the base decade and the number of decades. The base decade indicates the decade for the first rank. The second line is an integer than indicates the  number of decades each name is ranked. This value shall be greater than or equal to 2. The given file starts

        1900
        11

The rest of the data file are names and there ranks. There is no indication in the data I give you whether a name is a female or male name. On each line there is  a name, followed by the rank of that name in the decades 1900, 1910, 1920, ..., 2000 (11 numbers), for the example file. Your program must handle different starting decades and number of ranks correctly.

If a line does not have the correct number of ranks based on the the integer at the top of the file, your program shall ignore that line. (In other words do not make a NameRecord from a line in the data file if the line does not contain the correct number of ranks.)

A rank of 1 indicates the name was the most popular name that year, while a rank of 997 was not very popular. A 0 in the data file means the name did not appear in the top 1000 that decade.  The lines in the file are in alphabetical order based on names.

...
Sam 58 69 99 131 168 236 278 380 467 408 466
Samantha 0 0 0 0 0 0 272 107 26 5 7
Samara 0 0 0 0 0 0 0 0 0 0 886
Samir 0 0 0 0 0 0 0 0 920 0 798
Sammie 537 545 351 325 333 396 565 772 930 0 0
Sammy 0 887 544 299 202 262 321 395 575 639 755
Samson 0 0 0 0 0 0 0 0 0 0 915
Samuel 31 41 46 60 61 71 83 61 52 35 28
Sandi 0 0 0 0 704 864 621 695 0 0 0
Sandra 0 942 606 50 6 12 11 39 94 168 257
...

Note,  a 0 in the data file means the name was NOT RANKED  in the top 1000 during the corresponding decade. It has some unknown rank greater than 1000. When you store a 0 from the data file in your NameRecord objects you may use something other than 0 if you think it will make your algorithms easier to implement. (Recall altering the way data is stored to fit our needs is part of the magic of encapsulation.)

Also note the data does not indicate if a name was a girl's name or a boy's name. If a name appeared on the list of boys and girls names for a given decade the rank closets to 1 was used.

We see that “Sam” was #58 in 1900 and is slowly moving down. “Samantha” popped on the scene in 1960 and is moving up strong to #7. “Samir” barely appears in 1980, but by 2000 is up to #798. The database is for children born in the US, so ethnic trends show up when immigrants have kids.

You are given one one partial class, NameSurfer.java. This is the main driver class. When this class's main method is called it opens a window to pick the file with the names in it. The file is called names.txt. After creating the database of names encapsulated in a class called Names.java the program displays a menu and allows the user to make various queries of the database. The provided version of NameSurfer.java shows five of the options. You shall add three more.

Important. Do not hard code or assume every file will start with 1900 and / or have 11 ranks per name. Your program must read those values from the file and work correctly. I recommend you create your own results with names2.txt and post the results to Piazza to compare with your classmates.


Suggested steps for implementing the program.

0. These suggested steps describe implementing the requirements. You may add more helper methods  if you want. You may store data in a different form than the input file. For example, the files uses a 0 to indicate the name had a rank greater than 1000 for a given decade. You can store a value other than 0 if you want. (The value for "not ranked in a given decade" is on the other side of the wall of abstraction.)

1. Implement and test a class called NameRecord. Each NameRecord object stores the data for an individual name, including the name itself (a String), the base decade (decade of the first rank), and the rank of the name for each decade. The ranks for each decade must be stored in an ArrayList of Integers. (this is to give you practice using ArrayList.) The class must have the following properties:

A constructor with appropriate parameters to create a new NameRecord. 
 
a method to get the name for this NameRecord.
 
a method that returns the base decade (decade of the first rank) for this NameRecord.. 
 
a method that returns this NameRecords rank for a given decade. You can use the convention that 0 is the start decade (1900 in the example file)), 1is the next decade (1910 in the example file), and so forth. In other words the parameter will be between 0 to (max number of ranks - 1) for given data set inclusive. In other words [0, num decades) or [0, num decades - 1].
 
a method that returns an int for this NameRecord's best decade. In other words it returns the decade this name was most popular, using the most recent decade in the event of a tie. Return the actual decade based on the initial decade for this NameRecord's ranks. Looking at the data above, Samir's was most popular in 2000  while Sandra's best decade was 1940 . David's best rank was number 2 and it occurred in 1950, 1960, and 1980. The method shall return, the most recent decade, 1980.
 
a method that returns the number of decades this name has been ranked in the top 1000.
 
a method that returns true if this name has been ranked in the top 1000 in every decade
 
a method that returns true if this name has been ranked in the top 1000 in only one decade
 
a method that returns true if this name has been getting more popular every decade in the time period covered. This will be true if every decades rank is better (closer to 1) than the previous decade. The rank must improve every decade, it cannot be equal to the previous decade. [the integers representing ranks must be monotonically decreasing] So for example if the ranks were 0 950 900 875 850 800 750 700 650 600 500 400 are always improving. The ranks 0 0 0 800 850 800 750 700 650 600 500 400 are not always improving. Recall a 0 means the name did not appear in the top 1000 names for that decade. (When the rank is 0 indicating a name wasn't in the top 1000 in two consecutive decades we can't really decide one way or another if it improved from one decade to the next. So we assume it was not getting more popular.)
 
a method that returns true if this name has been getting less popular [the integers representing ranks must be monotonically increasing] every decade in the time period covered. This will be true if every decades rank is worse than the previous decade. The rank must get worse, it cannot be equal to the previous decade. So for example the ranks 100 150 200 202 250 300 350 400 450 460 0 are always getting worse. The ranks  100 150 200 202 250 300 350 400 0 0 0 are not always getting worse. 
 
override the toString from Object. The format of the returned String must be:
<NAME
<BASE DECADE: <RANK IN BASE DECADE
<BASE DECADE+ 10: <RANK IN BASE DECADE + 10
...
<LAST DECADE: <RANK IN LAST DECADE

Each line in the returned String has a newline at the end. If a name is not ranked in a given decade the Rank shall be a zero.
After completing all these methods you shall thoroughly test the NameRecord class using individual lines from the names.txt file or with your own data. Include your testing code in a method in your NameSurfer class even though it will not be called when the program is run. Part of the assignment grade will be based on the tests you write for the NameRecord class. For this assignment you may not share tests.

2. Implement and test the Names class. This class stores all of the NameRecord objects in an ArrayList. (private ArrayList<NameRecord names) This class must have the following methods. All methods that return an ArrayList of NameRecords or an  ArrayList of  Strings must be in sorted ascending order based on the names. Do not change the method headers provided in the Names.java file.

A constructor that takes in a Scanner object. The Scanner object will already be hooked up to the names.txt file. The constructor must read in the base decade and number of ranks per name and then use the Scanner to go through all the names in the file and create a NameRecord object for each one. This can be done with the following code:

String line
while( fileScanner.hasNextLine() ){
    line = fileScanner.nextLine();
    /* create a NameRecord object based on line and add it to the ArrayList of NameRecord objects */
}

How do you parse the String into the name and the years? Use the split method from the String class or a Scanner object. If you use the split method from the String class the statement would be:

String[] parsedData = data.split("\\s+"); //data is the input String

This parses the String into its individual components using whitespace as delimiters. The name itself will be the first element of the resulting array and the ranks will be in the remaining elements, although they will be Strings. You must convert them to ints using the Integer.parseInt method.. 

The other approach for parsing the input is to use a Scanner object.

Scanner lineScanner = new Scanner(data);  //data is the input String

Once the Scanner object is created you can use the next method to pull out the name and then the nextInt method to pull out the ranks. See the Java documentation for more details on the split method from the String class and Scanner objects. 

Recall if a line does not have the correct number of ranks you do not add a NameRecord for that line to the Names object.
 
A method that returns the NameRecord whose name is equal to a given String ignoring case. If there is no NameRecord that completely matches the given String return null. 
 
A method that returns an ArrayList of NameRecord objects that contain a given substring, ignoring case.  The names must be in sorted order based on name. 
public ArrayList<NameRecord getMatches(String partialName)
 
A method that returns an ArrayList of Strings of names that have been ranked in the top 1000 or better in every decade. The Strings  must be in sorted order based on name. 
public ArrayList<String rankedEveryDecade()
 
A method that returns an ArrayList of Strings of names that have been ranked in the top 1000 or better in exactly one decade. The Strings  must be in sorted order based on name. 
public ArrayList<String rankedOnlyOneDecade()
 
A method that returns an ArrayList of Strings of names that have been getting more popular every decade. The Strings  must be in sorted order based on name. 
public ArrayList<String alwaysMorePopular()
 
A method that returns an ArrayList of Strings of names that have been getting less popular every decade. The Strings  must be in sorted order based on name. 
public ArrayList<String alwaysLessPopular()
 
implement a method of your own that checks for names that meet some criteria of your own design. You must thoroughly document what your criteria is in the method comments. Part of your grade will be based on how original and creative this method is. Do not share this method with others or on the class discussion group. I don't consider finding names that are palindromes very interesting.
3. Complete the methods in the NameSurfer class and add methods for the missing  menu options.

The menu choices must be:

1 to search for names.
2 to display data for one name.
3 to display all names that appear in only one decade.
4 to display all names that appear in all decades.
5 to display all names that are more popular in every decade.
6 to display all names that are less popular in every decade.
7 to perform the method of your own design from your Names class
8 to quit

For expected program behavior review the sample run log. Your output shall match that of the sample run log for the various operations, except that you will not trim the output for operations that results in a large number of names and your output shall include the description of the search you developed.

4. Neat searches. This is an interesting application because when your program is finished you can investigate various trends in naming children. In a comment at the top of the file discuss one interesting trend you found and back it up with data / results. Here are some examples. (You may not use these as your interesting trend. Do not share your interesting trend on the discussion group.)

Why do you think Rock popular in the 1950s and Trinity in 2000?
Type in your grandparents names. Names like Mildred, Clem, Earl, and Clarence (my grandparents' names) sound old fashioned. But wait long enough and some names stage a come back. Emma. Hannah. Isabelle. Anna.
Try historical names like Sigmund, Adolf, and Ike. What about variations such as Adolph?
Why are some letters listed as names. A? M? J? (In a more recent version of the database single letter names are not included.)
Nick got the idea for this assignment from an article by Peggy Orenstein of the New York Times, Where have all the Lisas Gone?

Recently there was an article on predicting a someone's age based on their  name.


Submission: Fill in the header for NameSurfer.java and copy it into NameRecord.java and Names.java. Replace <NAME with your name. Note, you are stating, on your honor, that you did the assignment on your own.

Create a zip file name a3.zip [case sensitive! Do not name the file A3.zip!] with your NameSurfer.java, Names.java, and NameRecord.java  files. The zip file must not contain any directory structure, just the three  required files.

See this page for instructions on how to create a zip via Eclipse.

Turn in a3.zip via your Canvas account to programming assignment 3.

Ensure you files are named NameSurfer.java, Names.java, and NameRecord.java. Failure to do so will result in points off.
Ensure NameSurfer.java, Names.java, and NameRecord.java. are part of the default package. Do not add a package statement to the either file.
Ensure your zip has no internal directory structure. When the file is unzipped no directories (folders) are created. (Note, the built in unzip feature of some versions of Windows and Apple OS X "help" by adding a directory when you unzip with the same name as the file. The unzip we use on the CS Linux system does not do this. Neither do unzip programs such as 7zip.)
Ensure you submit the assignment under Programming Assignment 3 in Canvas.
If you resubmit your file you must first delete the previous versions, otherwise Canvas renames the file.