ECA template ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 1 of 6 ECA – July Semester 2019 ANL251 End-of-Course Assessment – July Semester 2019 XXXXXXXXXXPython...

1 answer below »
I need help for python programming assignment. I've uploaded the file. All the details can be found in the uploaded file.



ECA template ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 1 of 6 ECA – July Semester 2019 ANL251 End-of-Course Assessment – July Semester 2019 Python Programming ___________________________________________________________________________ INSTRUCTIONS TO STUDENTS: 1. This End-of-Course Assessment paper comprises SIX (6) pages (including the cover page). 2. You are to include the following particulars in your submission: Course Code, Title of the ECA, SUSS PI No., Your Name, and Submission Date. 3. Late submission will be subjected to the marks deduction scheme. Please refer to the Student Handbook for details. IMPORTANT NOTE ECA Submission Deadline: 4 September 2019, 12 noon ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 2 of 6 ECA – July Semester 2019 ECA Submission Guidelines Please follow the submission instructions stated below: A - What Must Be Submitted You are required to submit the following TWO (2) items for marking and grading: • A report (you should submit this item first as it carries the highest weightage). • The Python notebook file & the used datasets B - Submission Deadline • The TWO (2) items of report, Python code and used datasets are to be submitted by 12 noon on the submission deadline. • You are allowed multiple submissions till the cut-off date for each of the TWO (2) items. • Late submission of any of the TWO (2) items will be subjected to mark-deduction scheme by the University. Please refer to Section 5.2 Para 2.4 of the Student Handbook. C - How the TWO (2) Items Should Be Submitted • The Report: submit online to Canvas via TurnItIn (for plagiarism detection). o please ensure that your Microsoft Word document is generated by Microsoft Word 2007 or higher. o the report must be saved in .docx format. • The Python code and used datasets: o write all your code in one file (.ipynb file) o compress the .ipynb file and used datasets as a .zip file o submit the .zip file to Canvas Avoid using a public WiFi connection for submitting large video files. If you are using public wireless (WiFi) connection (e.g. SG Wireless at public areas), you might encounter a break in the connection when sending large files. Please verify your submissions after you have submitted the above TWO (2) items. D – Please be Aware of the Following: Submission in hardcopy or any other means not given in the above guidelines will not be accepted. You do not need to submit any other forms or cover sheets (e.g. form ET3) with your ECA. ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 3 of 6 ECA – July Semester 2019 You are reminded that electronic transmission is not immediate. The network traffic may be particularly heavy on the date of submission deadline and connections to the system cannot be guaranteed. Hence, you are advised to submit your work early. Canvas will allow you to submit your work late but your work will be subjected to the mark-deduction scheme. You should therefore not jeopardise your course result by submitting your ECA at the last minute. It is your responsibility to check and ensure that your files are successfully submitted to Canvas. E - Plagiarism and Collusion Plagiarism and collusion are forms of cheating and are not acceptable in any form in a student’s work, including this ECA. Plagiarism and collusion are taking work done by others or work done together with others respectively and passing it off as your own. You can avoid plagiarism by giving appropriate references when you use other people’s ideas, words or pictures (including diagrams). Refer to the APA Manual if you need reminding about quoting and referencing. You can avoid collusion by ensuring that your submission is based on your own individual effort. The electronic submission of your ECA will be screened by plagiarism detection software. For more information about plagiarism and collusion, you should refer to the Student Handbook (Section 5.2.1.3). You are reminded that SUSS takes a tough stance against plagiarism or collusion. Serious cases will normally result in the student being referred to SUSS’s Student Disciplinary Group. For other cases, significant mark penalties or expulsion from the course will be imposed. Find a news article (that is of interest to you) from any trusted sources published in the last month. Formulate a research question in order to support, object to, or expand on the claim(s) in the selected news article. Find two (2) or more publicly accessible datasets on the web which can be used to answer your research question. You may need to go through the following tasks (a)-(d) multiple times in order to arrive at a meaningful research question and findings. (a) Identify the key claim(s) in the selected news article. Identify two (2) or more datasets that are publicly accessible. Analyse the dataset and answer the following questions. What kind of information is present in the datasets? How is the data organised and what common features can be used to relate the datasets? Are there data quality issues in the datasets (such as erroneous data, missing data, etc.)? Do you need to prepare (such as clean, transform, or manipulate) the raw data for analysis? (b) Describe the measures that you need to calculate in order to answer your research question. Refer to Appendix 1 for an example. (c) Apply your Python programming skills to generate summary statistics and graphical plots (or other form of visualisations) to address the stated research question. Explain your findings and observations. ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 4 of 6 ECA – July Semester 2019 (d) Identify some possible limitations of your findings. For example, are they limited to a certain city or country? Justify your assumptions about the data, if any. Present your work for tasks (a)-(d) in your report using the template provided in Appendix 2. Provide screenshots of the relevant Python codes for each task and its output(s) where appropriate. Keep your report concise and coherent as a self-contained entity. The evaluation criteria also include logical flow of your explanation, variety of the visualisations employed and appropriate summary statistic used. More marks will be awarded to answers with in-depth analyses and practical recommendations. Limit your report file size to a maximum of 4M Bytes. For a breakdown of the marks, please refer to Appendix 2. (80 marks) Write the codes you used for implementing tasks (a)-(d) in an .ipynb file. The program should have sufficient comments to describe the corresponding steps and analyse the logical flow for each task. (20 marks) Up to 25 marks of penalties will be imposed for inappropriate or poor paraphrasing. For serious cases, they will be investigated by the examination department. More information on effective paraphrasing strategies can be found on https://academicguides.waldenu.edu/writingcenter/evidence/paraphrase/effective. ----- END OF ECA PAPER ----- https://academicguides.waldenu.edu/writingcenter/evidence/paraphrase/effective ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 5 of 6 ECA – July Semester 2019 Appendix 1 Proposing Research Questions Example: Do university towns have their mean housing prices less affected by recessions? • The following datasets could be used: • From the Zillow research data site (http://www.zillow.com/research/data/) there is housing data for the United States. In particular, the datafile for all homes at a city level (http://files.zillowstatic.com/research/public/City/City_Zhvi_AllHomes.csv), has median home sale prices at a fine-grained level. • From the Wikipedia page on college towns is a list of university towns in the United States (https://en.wikipedia.org/wiki/List_of_college_towns#College_towns_in_the_Unit ed_States). • From Bureau of Economic Analysis, US Department of Commerce, the GDP over time (http://www.bea.gov/national/index.htm#gdp) of the United States in current dollars (use the chained value in 2009 dollars), in quarterly intervals. • You may, for example, compare the ratio of the mean price of houses in university towns the quarter before the recession starts compared to the recession bottom. A recession is defined as starting with two consecutive quarters of GDP decline, and ending with two consecutive quarters of GDP growth. A recession bottom is the quarter within a recession which had the lowest GDP. ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 6 of 6 ECA – July Semester 2019 Appendix 2 Report Template & Marks Breakdown Title of the ECA project your name, SUSS PI No. and submission date Abstract – Briefly describe the dataset, research question, methods, and findings. (maximum 150 words) (10 marks) 1. Datasets Document your work for Task (a) here. Provide the web links to retrieve the used datasets. Provide screenshots of the relevant Python code and its output. (15 marks) 2. Research Question Provide the web link to retrieve the news article of your interest. Briefly describe the key claims and supporting reasons in the article. Explain how your research question supports, objects to or expands the claims in your selected news article. Document your work for Task (b) here. (10 marks) 3. Analysis Document your work for Tasks (c) here. Provide screenshots of the relevant Python code and its output. (30 marks) 4. Conclusions Document your work for Task (d) here. (10 marks) Reference (5 marks) Appendix (optional) 1
Answered Same DayAug 10, 2021

Answer To: ECA template ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 1 of 6 ECA...

Prasun Kumar answered on Sep 04 2021
135 Votes
Does Singapore’s GDP affect its taxi ridership?
Chay Whye Hoe, SUSS PI No. and September 4, 2019
Abstract - The hypothesis of this paper is that with a increasing economy, people tend to avoid travelling by taxis. To prove/disapprove this reasoning,
datasets provided by 2 service providers have been explored (1) from Dept. of Statistics, Government of Singapore (GDP data) and from Land Transport Authority, Government of Singapore (transportation data - taxi and public transport). Using Python libraries such as Tabula, Numpy, Matplotlib and Pandas, data scraping, cleaning and analysis has been done. Correlation analysis of various parameters show that there is a significant correlation between these variables.
1. Datasets
To support the hypothesis, 3 datasets from 2 sources have been used in the current study. The Land Transport Authority (LTA), under the Ministry of Transport, provides data on taxi ridership (LTA, Publications & Research, 2019) from different sources (such as metered taxis, ride-hail operators etc.) on a monthly basis. The dataset is provided in Adobe Acrobat (pdf) file format (taxi_info_*.pdf). It contains information such as average daily number of trips (for 1 shift and 2 shift taxis), number of monthly rides for different taxi fleets (such as comfort, citycab, trans-cab etc.) and information on taxi drivers’ vocational license. Figure 1 shows data scraping from pdf files using the Python module ‘Tabula’ into Pandas DataFrame.
Figure 1: Data scraping from a pdf file
LTA also provides monthly (which can be aggregated to quarterly and yearly) statistics on public transport (PT) ridership (LTA, PT Ridership, 2018), from 3 sources (Bus, Mass Rapid Transit (MRT) and Light Rail Transit (LRT)) through its alliance with Tableau. These were not available for download (Figure 2), hence, the data was aggregated at quarterly basis and scraped manually (public transport data.txt).
Figure 2: LTA PT Ridership Website Screenshot
The 3rd data source is the Department of Statistics (DOS), Government of Singapore which provides detailed quarterly Gross Domestic Product (GDP) data, bifurcated by industry and using different estimation strategies (Government of Singapore, DOS, 2019). This data is available as Microsoft Excel file (xlsx) format (gdp data.xlsx). For this analysis the “GDP at Current Prices, By Industry (SSIC 2015), Quarterly” has been used. Figure 3 shows reading the data in Python using Pandas.
Figure 3: Reading GDP data in excel file using...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here