Skip to main content

Gradient Boosting

1 selectedDifficulty 5-51 unseenView topic
IntermediateNew
0 answered
1 intermediateAdapts to your performance
Question 1 of 1
120sintermediate (5/10)conceptual
In gradient boosting with squared loss, each new tree is fit to the negative gradient of the loss, which equals the residuals. Why does boosting use the negative gradient rather than directly fitting the residuals for general loss functions?