99爱在线视频这里只有精品_窝窝午夜看片成人精品_日韩精品久久久毛片一区二区_亚洲一区二区久久

合肥生活安徽新聞合肥交通合肥房產生活服務合肥教育合肥招聘合肥旅游文化藝術合肥美食合肥地圖合肥社保合肥醫院企業服務合肥法律

代寫MS6711、代做Python語言程序
代寫MS6711、代做Python語言程序

時間:2025-03-07  來源:合肥網hfw.cc  作者:hfw.cc 我要糾錯



MS6711 Data Mining
Homework 2
Instruction
This homework contains both coding and non-coding questions. Please submit two files,
1. One word or pdf document of answers and plots of ALL questions without coding details.
2. One jupyter notebook of your codes.
3. Questions 1 and 2 are about concepts, 3 - 6 are about coding.
1
Problem 1 [20 points]
We perform best subset, forward stepwise and backward stepwise selection on the same dataset with p
predictors. For each approach, we obtain p + 1 models containing 0, 1, 2, · · · , p predictors. Explain your
answer.
1. Which of the three models with same number of k predictors has smallest training RSS?
2. Which of the three models with same number of k predictors has smallest testing RSS? (best
subset, forward, backward, or cannot determine?)
3. True or False: The predictors in the k-variable model identified by forward stepwise are a subset of
the predictors in the (k + 1)-variable model identified by forward stepwise selection.
4. True or False: The predictors in the k-variable model identified by best subset are a subset of the
predictors in the (k + 1)-variable model identified by best subset selection.
5. True or False: The lasso, relative to OLS, is less flexible and hence will give improved prediction
accuracy when its increase in bias is less than its decrease in variance.
2
Problem 2 [20 points]
Suppose we estimate Lasso by minimizing
||Y − Xβ||2
2 + λ||β||1
for a particular value of λ. For part 1 to 5, indicate which of (a) to (e) is correct and explain your answer.
1. As we increase λ from 0, the training RSS will
(a) Increase initially, and then eventually start decreasing in an inverted U shape.
(b) Decrease initially, and then eventually start increasing in a U shape.
(c) Steadily increase.
(d) Steadily decrease.
(e) Remain constant.
2. Repeat 1. for test RSS.
3. Repeat 1. for variance.
4. Repeat 1. for (squared) bias.
3
Problem 3 [20 points]
These data record the level of atmospheric ozone concentration from eight daily meteorological mea surements made in the Los Angeles basin in 1976. We have the 330 complete cases1. We want to find
climate/weather factors that impact ozone readings. Ozone is a hazardous byproduct of burning fossil
fuels and can harm lung function. The data set for this problem is:
Variable name Definition
ozone Long Maximum Ozone
vh Vandenberg 500 mb Height
wind Wind speed (mph)
humidity Humidity (%)
temp Sandburg AFB Temperature
ibh Inversion Base Height
dpg Daggot Pressure Gradint
ibt Inversion Base Temperature
vis Visibility (miles)
doy Day of the Year
[Note: I would recommend you use R for this question, since python does not have package for
forward / backward selection. See the code example on Canvas. Or you may use the sample python code
I provided.]
1. Report result of linear regression using all variables. Note that ozone is the response variable to
predict. What variables are significant?
2. Report the selected variables using the following model selection approaches.
(a) All subset selection.
(b) Forward stepwise
(c) Backward stepwise
3. Compare the outcome of these methods with the significant variables found in the full linear regres sion in question 1.
4. Potentially, other transformation of covariates might be important. What happens if you do all
subset selection using both the original variables and their square? That is, for all variables, include
4
both
X, X2
in the linear regression model for all subset selection.
5
Problem 4 [20 points]
In this exercise, we will predict the number of applications received using the other variables in the College
data set.
Private Public/private school indicator
Apps Number of applications received
Accept Number of applicants accepted
Enroll Number of new students enrolled
Top10perc New students from top 10% of high school class
Top25perc 1 = New students from top 25 % of high school class
F.Undergrad Number of full-time undergraduates
P.Undergrad Number of part-time undergraduates
Outstate Out-of-state tuition
Room.Board Room and board costs
Books Estimated book costs
Personal Estimated personal spending
PhD Percent of faculty with Ph.D.
Terminal Percent of faculty with terminal degree
S.F.Ratio Student faculty ratio
perc.alumni Percent of alumni who donate
Expend Instructional expenditure per student
Grad.Rate Graduation rate
1. Split the data set into a training set and a test set.
2. Fit a linear regression model using OLS on the training set, and report the test error obtained.
3. Fit a ridge regression model on the training set, with λ chosen by cross-validation. Report the test
error obtained.
4. Fit a lasso model on the training set, with λ chosen by cross-validation. Report the test error
obtained, along with the number of non-zero coefficient estimates.
5. Fit a PCR model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of M selected by cross-validation.
6. Fit a PLS model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of number of components selected by cross-validation.
6
Problem 5 [20 points]
We will now try to predict per capita crime rate in the Boston data set.
crim per capita crime rate by town.
zn proportion of residential land zoned for lots over 25,000 sq.ft.
indus proportion of non-retail business acres per town.
chas Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).
nox nitrogen oxides concentration (parts per 10 million).
rm 1 = average number of rooms per dwelling.
age proportion of owner-occupied units built prior to 1940.
dis weighted mean of distances to five Boston employment centres.
rad index of accessibility to radial highways.
tax full-value property-tax rate per $10,000.
ptratio pupil-teacher ratio by town.
black 1000(Bk − 0.63)2 where Bk is the proportion of blacks by town.
lstat lower status of the population (percent).
medv median value of owner-occupied homes in $1000s.
1. Try out some of the regression methods explored in this chapter, such as best subset selection, the
lasso, ridge regression, PCR and partial least squares. Present and discuss results for the approaches
that you consider.
2. Propose a model (or set of models) that seem to perform well on this data set, and justify your
answer. Make sure that you are evaluating model performance using validation set error, cross validation, or some other reasonable alternative, as opposed to using training error.
3. Does your chosen model involve all of the features in the data set? Why or why not?
7
Problem 6 [20 points]
In a bike sharing system the process of obtaining membership, rental, and bike return is automated
via a network of kiosk locations throughout a city. In this problem, you will try to combine historical
usage patterns with weather data to forecast bike rental demand in the Capital Bikeshare program in
Washington, D.C.
You are provided hourly rental data collected from the Capital Bikeshare system spanning two years.
The file Bike train.csv, as the training set, contains data for the first 19 days of each month, while
Bike test.csv, as the test set, contains data from the 20th to the end of the month. The dataset includes
the following information:
daylabel day number ranging from 1 to 731
year, month, day, hour hourly date
season 1=spring,2=summer,3=fall,4=winter
holiday whether the day is considered a holiday
workingday whether the day is neither a weekend nor a holiday
weather 1 = clear, few clouds, partly cloudy
2 = mist + cloudy, mist + broken clouds, mist + few clouds, mist
3 = light snow, light rain + thunderstorm + scattered clouds, light rain
4 = 4 = heavy rain + ice pallets + thunderstorm + mist, snow + fog
temp temperature in Celsius
atemp ’feels like’ temperature in Celsius
humidity relative humidity
wind speed wind speed
count number of total rentals, outcome variable to predict
Predictions will be evaluated using the root mean squared error (RMSE), calculated as
RMSE =
v
u
u t
n
1
nX
i=1
(yi − ybi)
2
where yi
is the true count, ybi
is the prediction, and n is the number of entries to be evaluated.
Build a model on train dataset to predict the bikeshare counts for the hours recorded in the test
dataset. Report your prediction RMSE on testing set.
Some tips
• This is a relatively open question, you may use any model you learnt from this class.
8
• It will be helpful to examine the data graphically to spot any seasonal pattern or temporal trend.
• There is one day in the training data with weird atemp record and another day with abnormal
humidity. Find those rows and think about what you want to do with them. Is there anything
unusual in the test data?
• It might be helpful to transform the count to log(count + 1). If you did that, do not forget to
transform your predicted values back to count.
• Think about how you would include each predictor into the model, as continuous or as categorical?
• Is there any transformation of the predictors or interactions between them that you think might be
helpful?
Try to summarize your exploration of the data, and modeling process. You may fit a few models and
chose one from them. You will receive points based on your write-up and test RMSE. This is not a
competition among the class to achieve the minimal RMSE, but your result should be in a reasonable
range.


請加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp



 

掃一掃在手機打開當前頁
  • 上一篇:INT5051代做、代寫Python編程設計
  • 下一篇:代寫COMP3334、代做C/C++,Python編程
  • 無相關信息
    合肥生活資訊

    合肥圖文信息
    急尋熱仿真分析?代做熱仿真服務+熱設計優化
    急尋熱仿真分析?代做熱仿真服務+熱設計優化
    出評 開團工具
    出評 開團工具
    挖掘機濾芯提升發動機性能
    挖掘機濾芯提升發動機性能
    海信羅馬假日洗衣機亮相AWE  復古美學與現代科技完美結合
    海信羅馬假日洗衣機亮相AWE 復古美學與現代
    合肥機場巴士4號線
    合肥機場巴士4號線
    合肥機場巴士3號線
    合肥機場巴士3號線
    合肥機場巴士2號線
    合肥機場巴士2號線
    合肥機場巴士1號線
    合肥機場巴士1號線
  • 短信驗證碼 豆包 幣安下載 AI生圖 目錄網

    關于我們 | 打賞支持 | 廣告服務 | 聯系我們 | 網站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網 版權所有
    ICP備06013414號-3 公安備 42010502001045

    99爱在线视频这里只有精品_窝窝午夜看片成人精品_日韩精品久久久毛片一区二区_亚洲一区二区久久

          9000px;">

                在线免费观看成人短视频| 五月婷婷综合网| 色综合久久久久综合| 久久精品国产精品亚洲红杏| 亚洲大尺度视频在线观看| 国产精品久久久久久久久晋中| 欧美一区二区三区影视| 在线观看欧美日本| 91视频精品在这里| 成人免费av网站| 不卡的电视剧免费网站有什么| 国产成人亚洲综合a∨婷婷| 裸体在线国模精品偷拍| 美日韩一区二区| 亚洲影视在线播放| 日韩欧美激情一区| 精品乱人伦小说| 欧美大片一区二区| 精品嫩草影院久久| 久久午夜色播影院免费高清| 久久久久久久久久久电影| 精品国产三级电影在线观看| 2019国产精品| 亚洲美女视频在线| 亚洲妇熟xx妇色黄| 久久国产乱子精品免费女| 国产高清不卡二三区| 成人av免费在线播放| 在线欧美小视频| 欧美日韩国产高清一区二区三区| 欧美精品xxxxbbbb| 久久久久99精品国产片| 国产精品久久久久久久岛一牛影视 | 国产女人水真多18毛片18精品视频| 国产亚洲视频系列| 亚洲毛片av在线| 天堂久久久久va久久久久| 国产永久精品大片wwwapp| 成人高清免费观看| 欧美在线影院一区二区| 日韩欧美一级二级三级| 亚洲国产精品成人综合| 一区二区三区在线免费| 青青草国产精品亚洲专区无| 国产福利精品一区| 欧美体内she精高潮| 久久人人超碰精品| 一区二区三区四区不卡在线| 捆绑调教一区二区三区| 99re免费视频精品全部| 欧美电影免费观看完整版| 国产精品不卡一区| 蜜臀国产一区二区三区在线播放| 丰满少妇在线播放bd日韩电影| 精品视频免费在线| 久久久99久久| 日韩av中文在线观看| 成人深夜在线观看| 日韩免费高清视频| 亚洲综合色网站| 国产精品资源网站| 91精品国产麻豆国产自产在线 | 成人福利电影精品一区二区在线观看| 欧美亚州韩日在线看免费版国语版| 欧美成人一级视频| 一区二区久久久| 国产99久久久久| 精品国产一区久久| 亚洲mv大片欧洲mv大片精品| 成人av网址在线| 日韩一级大片在线| 天天操天天色综合| 91在线视频观看| 国产成人精品一区二| 在线观看一区日韩| 亚洲色图欧美偷拍| 东方aⅴ免费观看久久av| 亚洲欧洲色图综合| 91成人免费在线视频| 蜜桃视频第一区免费观看| 欧美一级高清大全免费观看| 国产精品自产自拍| 亚洲18女电影在线观看| 欧美一区二区三区在线观看 | 欧美麻豆精品久久久久久| 一区二区欧美视频| 99riav一区二区三区| 亚洲国产精品久久一线不卡| 亚洲另类中文字| 国产精品免费视频网站| 国产精品视频九色porn| 国产精品久久久久久久久动漫| 国产视频视频一区| 亚洲国产sm捆绑调教视频| 蜜臀精品一区二区三区在线观看 | 亚洲国产精品ⅴa在线观看| 日韩免费一区二区三区在线播放| 91黄色免费网站| 国产婷婷色一区二区三区四区| 国产精品久久精品日日| 91精品午夜视频| 亚洲精品在线三区| 韩国v欧美v亚洲v日本v| 欧美伊人精品成人久久综合97| 精品在线一区二区| 国产精品白丝av| 精品粉嫩超白一线天av| 免费一级欧美片在线观看| 一区二区三区精品视频| 在线观看日产精品| 美女视频黄久久| 欧美人牲a欧美精品| 久久se精品一区精品二区| 久久久综合精品| 99re66热这里只有精品3直播 | 亚洲va国产天堂va久久en| 91蜜桃免费观看视频| 亚洲动漫第一页| 日韩精品一区二区三区在线观看| 国产一区二区三区观看| 欧美激情综合五月色丁香小说| 成人午夜大片免费观看| 国产精品久久久久久亚洲伦 | 91在线视频播放| 香蕉影视欧美成人| 日韩欧美国产三级| 99精品欧美一区二区三区小说| 亚洲在线一区二区三区| 欧美老人xxxx18| 丰满岳乱妇一区二区三区| 亚洲超丰满肉感bbw| 久久精品亚洲一区二区三区浴池 | 日韩avvvv在线播放| 精品福利在线导航| 成人一区二区三区视频 | 欧美一区二区三区影视| 99热精品一区二区| 麻豆成人综合网| 亚洲免费成人av| 久久蜜桃香蕉精品一区二区三区| 欧美视频一区在线观看| 国产**成人网毛片九色 | 欧美精品一区二区精品网| 色综合天天综合在线视频| 精东粉嫩av免费一区二区三区| 一区二区三区四区在线播放| 久久久青草青青国产亚洲免观| 色婷婷激情久久| 国产精一品亚洲二区在线视频| 亚洲国产欧美在线人成| 国产精品乱人伦一区二区| 欧美日韩国产精选| 日本精品一级二级| 99久久精品免费看| 国产九九视频一区二区三区| 日韩福利视频导航| 五月天激情综合| 亚洲免费在线播放| 国产精品久久久久精k8 | 日韩欧美在线影院| 欧美久久久久久蜜桃| 日本大香伊一区二区三区| 国产v日产∨综合v精品视频| 亚洲大片精品永久免费| 亚洲国产日韩a在线播放| 国产精品成人网| 中文字幕亚洲在| 亚洲精品一线二线三线无人区| 欧美日韩视频在线第一区 | 国产91丝袜在线播放| 免费成人小视频| 免费一区二区视频| 国产一区二区在线免费观看| 狠狠色2019综合网| 国产一区二区三区香蕉| 午夜久久久久久久久| 亚洲制服丝袜在线| 夜夜嗨av一区二区三区网页 | 国产麻豆成人精品| 国产精品亚洲一区二区三区妖精 | av激情综合网| 激情深爱一区二区| 久久精品国产澳门| 中文字幕五月欧美| 欧美激情综合在线| 国产精品欧美精品| 欧美日本精品一区二区三区| 欧美日韩精品二区第二页| 色哦色哦哦色天天综合| a在线播放不卡| 欧美日韩一区二区电影| 欧美日韩国产区一| 欧美精选在线播放| 欧美日韩中文字幕一区| 日韩精品专区在线影院重磅| 日韩欧美电影在线| 久久亚洲捆绑美女| 夜夜精品浪潮av一区二区三区| 亚洲mv在线观看| 六月丁香综合在线视频|