Using CDC's Oncology SEER dataset attached below to find the total number of occurrences of various breast cancers separately for men and women in four age groups (ages 0-24; 25-49; 50-74;...

2 answer below »

Using CDC's Oncology SEER dataset attached below to find the total number of occurrences of various breast cancers separately for men and women in four age groups (ages 0-24; 25-49; 50-74; 75+) . Save the output in a .csv file with nine things per line separated by commas: the cancer type (remove commas from this), total number of occurrences in men aged 0-24, and total number of occurrences in women aged 0-24, and so on. Use names from ICD-O3 for cancer types. You should print only the cancer types whose codes are found in ICD-O3 and it need not be in any particular order.You need to read the SEER file and then read the ICD-03 file to update the cancer type (so the cancer name shows in the final output file).Submit your .py file in Canvas.




Character positions in SEER file starting from left:



Sex
24



Age at diagnosis
25-27



Year of birth
28-31



Histology Type ICD-O-3
53-56



Behavior Code ICD-O-3
57




780044150000001543501 620771927 02082005C50828520385203211 0003 0111000000001001000000100100098818010010111 0102000205500205102090 0 01 216 260001748C508 1161023 09980131111 022051 000000000010136999933011158 01241 18000010 99 8 0200 780050310000001543102 020421962 02072005C50818500285232411 0001 0200000000002002000000000001098805000000001 0101000205500205104090 0 01 209 260002330D059 9999992 09980132200 022071 260002600040099999933092258 01021 05000000 99 8 0200 780060120000001543202 020741931 02112005C50428500385003111 0001 0062000000001001000000000002098815000010221 0102000205500205102490 0 01 215 260001744C504 1161023 09980132202 022089 503005030040236999933011158 01071 15000010 99 8 0200 780061510000001543501 020811924 03082005C50418480384803111 0013 0061000000001001000000000000098815000010111 0102000205500205105190 0 02 217 260001744C504 1161023 08980131101 022117 500805008040136999933011158 00911 15000010 99 8 0300 780078410000001543502 020551949 02122005C50818500385003311 0095 0251000000001001009800000000098820000032111 0102000205500205109090 0 01 212 260001748C508 1161023 09980132201 022071 000000000010136999931011158 01201 20000032 99 8 0200 780083170000001543501 020971908 02112005C50928500385003211 9800 9999999999901001009898798700098899999999991 0102000205500205100090 1 01 218 260001749C509 1161023 09980131109 022117 220302203040936999900011158 00131 99999999 99 8 0200 780083680000001543302 020521953 02082005C50918500285002311 0001 9990000000002002000000000001098805000000001 0102000205500205104090 0 01 211 260002330D059 9999992 09980132200 022109 260002600040099999933092258 00811 05000000 99 8 0200 780086440000001543502 020921913 02072005C50828520385203911 9999 9999999999901001009998798798798899999999991 0102000205500205100090 1 01 218 260001748C508 1161023 09980132209 022071 501205012040936999919911158 00231 99999999 99 8 0200 780090850000001543501 020771928 02072005C50828500285002311 9800 9990000000099999909800000001098805000000001 0102000205500205104190 0 01 216 260002330D059 9999992 09980131100 022087 000000000010099999930094458 01251 05000000 99 8 0200 780104000000001543201 020651939 02072005C50428500385003111 0015 0061000000001001000000000005098815000010111 0102000205500205105190 0 01 214 260001744C504 1161023 09980131101 022105 000000000010136999933011158 01251 15000010 99 8 0200 780105310000001543902 020711933 02102005C50518500385003211 0007 0081000000001001000000000000098815000010111 0103000205500205104090 0 01 215 260001745C505 1161023 09980132201 022051 000000000010136999933011158 01221 15000010 99 8 0200 780257710000001543201 020621943 02082005C50928500285002211 9800 0100000000002002009800000001098805000000001 0101000205500205102290 0 02 213 260002330D059 9999992 09980131100 022105 000000000010099999930092258 01241 05000000 99 8 0200 780297870000001543102 020481956 02072005C50428500385003211 0013 0601000000002002000000000000098830000033111 0102000205500205105190 0 02 210 260001744C504 1161023 09980132201 022095 000000000010136999933002258 01251 30000033 99 8 0200 780417940000001543201 020731932 02112005C50918500385003211 0002 0231000000001001000000000005098820000032111 0102000205500205104290 0 02 215 260001749C509 1161023 09980131101
Answered 11 days AfterOct 31, 2022

Answer To: Using CDC's Oncology SEER dataset attached below to find the total number of occurrences of various...

Robert answered on Nov 12 2022
41 Votes
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here