CSI 5810 (Assignment#2)1. Thefolder“CSI5810TextFiles” postedonMoodlecontains8textfiles.Youaretoapplytext-processingstepsincludingstopwordfilteringtoobtaintermdocumentmatrix...

1 answer below »
CSI 5810 (Assignment#2)
1. Thefolder“CSI5810TextFiles” postedonMoodlecontains8textfiles.Youare

toapplytext-processingstepsincludingstopwordfilteringtoobtaintermdocumentmatrix underBooleanModel. Usingthismatrix,calculate

similaritybetweenalldocumentpairsandshowyourresultsintheformof

an 8x8matrix. UseJaccard’ssimilaritymeasure.
2. ThisisacontinuationofExercise#1.Inthiscase,determinethevectorspace

representationforeachdocumentandcalculatethe8x8documentsimilarity

matrixusingCosinemeasureofsimilarity.
3. Inthisexercise,youwilluse“WheatData”postedatMoodle.Thedata

consistsof32 trainingexampleseachfromthree classes.Usingthesetraining

examples,youwillperformclassificationof3 testexamplesbyk-NN

classification (k=1,3,and5), weightedk-NN(3and5) andbyNaïveBayes

classifier.YouwillwriteyourowncodetoimplementNaiveBayes.Compare

andcommentonyourresults.
4. Inthisexercise,youwillagainuse32trainingexamplesofwheatdataand

projecttheminto two-dimensionsusingtheFisher’sLDAmethodfor

multipleclasses.Next,youwillapplyPCAonthesame32examplestoreduce

thedatatotwodimensions.Youwillshowyourresultbycreatingtwoscatter

plots,oneforLDAandtheotherforPCA.Makesure tocolorcodetheproject

pointswiththeirrespectiveclasslabels.


Sheet1 feat1feat2feat3feat4feat5feat6feat7Class 15.2614.840.8715.7633.3122.2215.221 14.8814.570.88115.5543.3331.0184.9561 14.2914.090.9055.2913.3372.6994.8251 13.8413.940.89555.3243.3792.2594.8051 16.1414.990.90345.6583.5621.3555.1751 14.3814.210.89515.3863.3122.4624.9561 14.6914.490.87995.5633.2593.5865.2191 14.1114.10.89115.423.3022.75.0011 16.6315.460.87476.0533.4652.045.8771 16.4415.250.8885.8843.5051.9695.5331 15.2614.850.86965.7143.2424.5435.3141 14.0314.160.87965.4383.2011.7175.0011 13.8914.020.8885.4393.1993.9864.7381 13.7814.060.87595.4793.1563.1364.8721 13.7414.050.87445.4823.1142.9324.8251 14.5914.280.89935.3513.3334.1854.7811 13.9913.830.91835.1193.3835.2344.7811 15.6914.750.90585.5273.5141.5995.0461 14.714.210.91535.2053.4661.7674.6491 12.7213.570.86865.2263.0494.1024.9141 14.1614.40.85845.6583.1293.0725.1761 14.1114.260.87225.523.1682.6885.2191 15.8814.90.89885.6183.5070.76515.0911 12.0813.230.86645.0992.9361.4154.9611 15.0114.760.86575.7893.2451.7915.0011 16.1915.160.88495.8333.4210.9035.3071 13.0213.760.86415.3953.0263.3734.8251 12.7413.670.85645.3952.9562.5044.8691 14.1114.180.8825.5413.2212.7545.0381 13.4514.020.86045.5163.0653.5315.0971 13.1613.820.86625.4542.9750.85515.0561 15.4914.940.87245.7573.3713.4125.2281 17.6315.980.86736.1913.5614.0766.062 16.8415.670.86235.9983.4844.6755.8772 17.2615.730.87635.9783.5944.5395.7912 19.1116.260.90816.1543.932.9366.0792 16.8215.510.87866.0173.4864.0045.8412 16.7715.620.86385.9273.4384.925.7952 17.3215.910.85996.0643.4033.8245.9222 20.7117.230.87636.5793.8144.4516.4512 18.9416.490.8756.4453.6395.0646.3622 17.1215.550.88925.853.5662.8585.7462 16.5315.340.88235.8753.4675.5325.882 18.7216.190.89776.0063.8575.3245.8792 20.216.890.88946.2853.8645.1736.1872 19.5716.740.87796.3843.7721.4726.2732 19.5116.710.8786.3663.8012.9626.1852 18.2716.090.8876.1733.6512.4436.1972 18.8816.260.89696.0843.7641.6496.1092 18.9816.660.8596.5493.673.6916.4982 21.1817.210.89896.5734.0335.786.2312 20.8817.050.90316.454.0325.0166.3212 20.116.990.87466.5813.7851.9556.4492 18.7616.20.89846.1723.7963.126.0532 18.8116.290.89066.2723.6933.2376.0532 18.5916.050.90666.0373.866.0015.8772 18.3616.520.84526.6663.4854.9336.4482 16.8715.650.86486.1393.4633.6965.9672 19.3116.590.88156.3413.813.4776.2382 18.9816.570.86876.4493.5522.1446.4532 18.1716.260.86376.2713.5122.8536.2732 18.7216.340.8816.2193.6842.1886.0972 16.4115.250.88665.7183.5254.2175.6182 17.9915.860.89925.893.6942.0685.8372 13.0713.920.8485.4722.9945.3045.3953 13.3213.940.86135.5413.0737.0355.443 13.3413.950.8625.3893.0745.9955.3073 12.2213.320.86525.2242.9675.4695.2213 11.8213.40.82745.3142.7774.4715.1783 11.2113.130.81675.2792.6876.1695.2753 11.4313.130.83355.1762.7192.2215.1323 12.4913.460.86585.2672.9674.4215.0023 12.713.710.84915.3862.9113.265.3163 10.7912.930.81075.3172.6485.4625.1943 11.8313.230.84965.2632.845.1955.3073 12.0113.520.82495.4052.7766.9925.273 12.2613.60.83335.4082.8334.7565.363 11.1813.040.82665.222.6933.3325.0013 11.3613.050.83825.1752.7554.0485.2633 11.1913.050.82535.252.6755.8135.2193 11.3412.870.85965.0532.8493.3475.0033 12.1313.730.80815.3942.7454.8255.223 11.7513.520.80825.4442.6784.3785.313 11.4913.220.82635.3042.6955.3885.313 12.5413.670.84255.4512.8793.0825.4913 12.0213.330.85035.352.814.2715.3083 12.0513.410.84165.2672.8474.9885.0463 12.5513.570.85585.3332.9684.4195.1763 11.1412.790.85585.0112.7946.3885.0493 12.113.150.87935.1052.9412.2015.0563 12.4413.590.84625.3192.8974.9245.273 12.1513.450.84435.4172.8373.6385.3383 11.3513.120.82915.1762.6684.3375.1323 11.24130.83595.092.7153.5215.0883 11.02130.81895.3252.7016.7355.1633 11.5513.10.84555.1672.8456.7154.9563 test samples 13.213.660.88835.2363.2328.3155.056? 16.2315.180.8855.8723.4723.7695.922? 12.7313.750.84585.4122.8823.5335.067?
Answered 1 days AfterOct 10, 2022

Answer To: CSI 5810 (Assignment#2)1. Thefolder“CSI5810TextFiles”...

Sathishkumar answered on Oct 12 2022
54 Votes
Sheet1
    feat1    feat2    feat3    feat4    feat5    feat6    feat7
    13.2    13.66    0.8883    5.236    3.232    8.315    5.056
    16.
23    15.18    0.885    5.872    3.472    3.769    5.922
    12.73    13.75    0.8458    5.412    2.882    3.533    5.067
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here