Project Goal: You are given a CSV file “220.csv”, a filtered email archive containing “Subject”, names in columns “From”, “To”, “CC”, and “BCC”; and “Importance”. You are asked to write a Python...









Project Goal:
You are given a CSV file “220.csv”, a filtered email archive containing “Subject”, names in columns “From”, “To”, “CC”, and “BCC”; and “Importance”. You are asked to write a Python program to compute statistics that will help identify the roles each involved person (including organizations) plays in the email conversations. In particular, for each person (or organization) that appears in the file, you need to find out how many conversations that person was involved in, how many times that person appeared in the “From”, “To”, “CC”, and “BCC” columns, respectively. And whether that person ever appeared in a conversation that was marked as highly important.




Data Description:
A conversation is a series of emails with the same subject. Note that prefixes of an email subject, such as “RE:” and “FW:”, are not considered as part of the subject, i.e., they must be peeled off when extracting the subject strings.



Names of people
appears
in format: Last, First Middle-Initial (Location), e.g.,


Khammivong, Somsanouk N (HOU)



Note that the middle initial may be omitted. Also note that a name can be an organization, such as



DRCS_Support



Multiple names in the same column are separated by “;”.



Each email is marked as “Normal” or “High” in importance.



You are encouraged to open the CSV file in Notepad to examine the format of the data.




Output:
The output of the program should be a CSV file whose header line contains column names “Name”, “Total”, “Conversations”, “From”, “To”, “CC”, “BCC”, and “Importance”. The other rows contain data specified as follows:




  • Column “Name”: the name of the person or organization


  • Column “Total: the total number of times the person appears in all the emails.


  • Column “Conversations”: the number of conversations the person is involved (i.e., how many times that person appears in at least one of the “From”, “To”, “CC”, and “BCC” columns of at least one email in the conversation).


  • Column “From”: the number of times the person appears in the “From” column.


  • Column “To”: the number of times the person appears in the “To” column.


  • Column “CC”: the number of times the person appears in the “CC” column.


  • Column “BCC”: the number of times the person appears in the “BCC” column.


  • Column “Importance”: If the person appears in at least one email that is marked “High” importance, the value is “High”. Otherwise, the value is “Normal”.

Dec 04, 2019STAT 5301
SOLUTION.PDF

Get Answer To This Question

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here