Microsoft Word - ITECH1103 Analytic Group Assignment - Semester 2 2018 1 | P a g e ITECH1103- Big Data and Analytics Group Assignment – Semester 3, 2018 Worth – 30% ANALYTIC REPORT (20%- Due Week 11...

1 answer below »
hey how much will u charge for the report


Microsoft Word - ITECH1103 Analytic Group Assignment - Semester 2 2018 1 | P a g e ITECH1103- Big Data and Analytics Group Assignment – Semester 3, 2018 Worth – 30% ANALYTIC REPORT (20%- Due Week 11 Sunday 11:55pm) and PRESENTATION (10% - Due Week 10 in Tutorial Time) Analytic Report: Learning Outcomes Assessed: A3, K3, K6, and S2: Purpose: The purpose of this task is to provide students with practical experience in working in teams to write a Data Analytical report to provide useful insights, pattern and trends in the chosen/given dataset. This activity will give students the opportunity to show innovation and creativity in applying Watson Analytics and designing useful visualization solutions and predictive solutions for various analytics problems. Group Presentation: Week 10 (Scheduled Laboratory) Learning Outcomes Assessed: K4, A1, A2, V1, V2 Purpose: The purpose of the oral presentation is to provide an opportunity for students to present the results of DATA Analysis and to share this knowledge while practicing their verbal communication skills Project Details: Consider you are working as a Content Analyst in an ABC online multimedia company and your task for this analytical project is to use analytical tool (i.e. IBM Watson Analytics) to explore, analyse and visualize the given dataset. This dataset reflects details about different videos, uploaded during the period from 2006 to 2018. The original dataset is extracted from the Kaggle.com and then modified and uploaded onto https://data.world/iamdilan/youtube-dataset. Your primary goal is to download the modified dataset and provide different and interesting insights in the lights of 20 guided questions listed below along with advance insights . The dataset could be downloaded from the following link Dataset source: https://data.world/iamdilan/youtube-dataset Data Dictionary: Video_id Unique identity of video Trending_date trending date of video Title Name of video Channel_title Name of channel Category_id : see category list below (table) Publish_date The date on which the video was published Time_frame The time at which the video was uploaded/published Publish_day_of_week Day of the week video published Publish_country Country in which video published Tags Tags https://data.world/iamdilan/youtube-dataset https://data.world/iamdilan/youtube-dataset 2 | P a g e Views Number of views of video Likes Number of likes of video Dislikes Number of dislikes of video Comments_count Number of comment for a video Comments_disable Whether comment is disable or not Ratings_disabled Whether ratings is disabled or not Video_error_or_removed Whether video has error or it is removed YouTube Video Category Id list: 2 - Autos & Vehicles 1 - Film & Animation 10 - Music 15 - Pets & Animals 17 - Sports 18 - Short Movies 19 - Travel & Events 20 - Gaming 21 - Videoblogging 22 - People & Blogs 23 - Comedy 24 - Entertainment 41 - Thriller 42 - Shorts 43 - Shows 44 - Trailers 25 - News & Politics 26 – How to & Style 27 - Education 28 - Science & Technology 29 - Nonprofits & Activism 30 - Movies 31 - Anime/Animation 32 - Action/Adventure 33 - Classics 34 - Comedy 35 - Documentary 36 - Drama 37 - Family 38 - Foreign 39 - Horror 40 - Sci-Fi/Fantasy 3 | P a g e You are expected to present the data findings in a visual forms (i.e., charts and graphs). This is a group assignment. You will complete it with your team (max 3 members enrolled in the same laboratory). It is expected that each team member will contribute equally in the project. Each team will turn in one joint document and give a joint presentation in Timetabled Laboratory class in Week 10. In addition, each individual team member will write a short reflection as part of the report. You will receive feedback on the draft about presentation choices, content, analysis, and style. The Questions Your job is to examine the dataset and present it in a set of informative graphs and text by answering the following questions. Guided Questions for Dataset 1. What is the total number of uploaded videos in this dataset? 2. How many different types of uploaded categories are there? 3. What is the number of countries in this dataset? 4. What is the number of (unique) channels in this dataset? 5. Which are the top three countries, according to number of channels, in this dataset? 6. What is the lowest number of channel by country? 7. How many different unique channels are there in the US? 8. Provide a list of the top 10 viewed video titles with respect to each country. 9. Provide a list of least 10 viewed video titles with respect to each country. 10. How many years of uploaded videos are there in the data file? 11. How many uploaded videos have there been in the last month? (Select the last month of the year) 12. In which year, were the most videos uploaded in GB? 13. Which hour had the most uploaded videos in this dataset? Is there any differences between countries? (time_frame) 14. What are the top 3 viewed categories in terms of number of uploaded videos? 15. What are the least 3 viewed categories in terms of number of uploaded videos? 16. Which video has the highest percentage of likes? 17. Which video has the highest percentage of dislikes? 18. Which day has the highest uploads of videos? 19. Which day has least uploads of videos? 20. What is monthly breakdown of published videos? Task 1- Background information Write a description of the selected dataset and project, and its importance for the firm. Information must be appropriately referenced. [1 Page] 4 | P a g e Task 2 – Reporting / Dashboards For your project, perform the relevant data analysis tasks by answering the above questions and, identify the visualization and dashboards you need to develop for the Content Manager of the indicated firm. [2-3 Pages] Task 3 – Advanced Insights: In addition to the guided questions, it is expected to provide at least five (5) insights of the data. These insights will be judged in terms of quality and complexity. Task 4 – Research Justify why these BI reporting solution/dashboards are chosen in Task 2 (Reporting / Dashboards) and why that dataset attributes are present and laid out in the fashion you proposed (feel free to include all other relevant justifications). Note: To ensure that you discuss this task properly, you must include visual samples of the reports you produce (i.e. the screenshots of the BI report/dashboard must be presented and explained in the written report; use ‘Snipping tool’), and also include any assumptions that you may have made about the analysis in your Task2 (i.e. the report to the content manager of the company). [1-2 Pages] Task 5 – Recommendations for Content Manager The Content Manager would like to improve the multimedia operations. Based on your BI analysis and the insights gained from the dataset in the lights of analysis performed in previous tasks, make some logical recommendations to the Content Manager, and justify why/how your proposal could enhance company’s multimedia operations and could assist in achieving operational/strategic objectives with the help of appropriate references from peer- reviewed sources. [1-2 Pages] Task 6 – Cover letter Write a cover letter to the Content Manager with the important data insights and recommendation to achieve operational/strategic objectives [1 page] Task 7 - The Reflection: Each Team member is expected to write a brief reflection about this project in terms of challenges, learning and contribution. Other Tasks – Please refer to marking scheme at the end of the assignment for other tasks and expectations. Report Submission: • Hard-copy to tutors/lecturers assignment box in week 10. Double- sided printing for the hard-copy is encouraged in order to save paper. • You will also submit a 7-8 pages report (about 1500 words not counting cover page and references) of this project. At least 15 references in your report must be from peer-reviewed sources. Include any and all sources of information including any person(s) you interviewed for this project. • Please note that all references must adhere to APA style. See http://owl.english.purdue.edu/owl/resource/560/01 and http://owl.english.purdue.edu/owl/resource/560/01 5 | P a g e http://www.apastyle.org/ for details on how to format a report and how to cite references. Make sure your follow formal report structure with cover page, introduction, use of headings, subheadings, conclusion sand reference section. • You are reminded to read the “Plagiarism” section of the course description. Your essay should be a synthesis of ideas from a variety of sources expressed in your own words. All reports must use the APA referencing style. University Referencing/Citation Style Guide: The University has published a style guide to help students correctly reference and cite information they use in assignments (American Psychological Association (APA) citation style, http://www.ballarat.edu.au/aasp/student/learning_support/generalguide/pri n t/ch06s04.shtml or Australian citation style • Reports are to be presented in hard copy in size 12 Arial Font and double spaced. Your report should include a list of references used in the essay and a bibliography of the wider reading you have done to familiarize yourself on the topic. • A passing grade will be awarded to assignments adequately addressing all assessment criteria. Higher grades require better quality and more effort. For example, a minimum is set on the wider reading required. A student reading vastly more than this minimum will be better prepared to discuss the issues in depth and consequently their report is likely to be of a higher quality. So before submitting, please read through the assessment criteria very carefully. http://www.apastyle.org/ http://www.ballarat.edu.au/aasp/student/learning_support/generalguide/prin http://www.ballarat.edu.au/aasp/student/learning_support/generalguide/prin
Answered Same DayJan 19, 2021ITECH1103

Answer To: Microsoft Word - ITECH1103 Analytic Group Assignment - Semester 2 2018 1 | P a g e ITECH1103- Big...

Sundeep answered on Jan 20 2021
136 Votes
Student Name:
Course ID:
Assessor Name:
Submission Date:
Task 1:
Background Information
The dataset that has to be worked upon is a YouTube dataset which has been compiled by either Google Analytics or by experts from over a time period of 13 years ( 2006 – 2018 )
The team has to study the database using IBM Watson. IBM Watson is an
analytics tool that has visualizations that are a part of the depiction of the data and the way the data has to be represented to the managers in a meeting. The tool is an online tool and it depends on the internet capacity and the speed the ease of use of the tool. The development of the analytic visualizations is done and the insights are represented in a form that would help the content manager to take decisions that would favour the organization.
We as a team have to analyse the given raw data and clean the data in the form that would enable us to draw insights.
In the data there are 4 countries which are represented. The countries and the people of the country upload videos, like, dislike, comment on them. YouTube is a company which is a part of Google and Google is a part of the parent company Alphabet. The dataset contains various attributes which help us analyse why the people are uploading videos at a specific time and why the people are engaged or involved
Task 2
The three dashboards that are given above are the contribution of individual countries towards uploading videos, the views per country and the category of the videos that are available in those countries.
There are a few countries which do not upload multiple videos but are actively contributing in increasing the reach. One such country is GB. The contribution of the country has been low in uploading videos but the country’s audience is actively involved on YouTube and accesses the videos, like them and even comment on them
The audience watching the videos mean that the quality of the videos, the channels present in the country and the content is liked by the audience. The people do not upload the vidoes may be because of the government rules and regulations ( like in north korea which is isolated from the rest of the world ) or may be due to some other factor. There are some countries in which the upload count is high while the involvement is low. This means that the content that the people are uploading isn’t upto the mark and not of the taste of the people
Task 3:
There are a few insights that have been uncovered in the analysis
· With the number of years passed, there has been a major growth change in the upload pattern of only GB. The country has started uploading multiple videos from the year 2017. The upload count was low till then
· Canada is an English and a French speaking country. The majority of people speak French and hence a very popular English song which is sung by Ed Sheran which is popular in most parts of the world is one of the least seen and viewed videos on YouTube
· Friday has seen the most number of uploads and Sunday has seen the least number of uploads. The reason may be because of start of weekend on the Friday and the end of weekend on Sunday
· There are variations in the upload times too. Among the other hours of the day, 4pm – 5pm has seen the maximum count of uploads. The variations are also seen as a part of the number of uploads that take place in a month. The variations differ since there are many...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here