htxu91的图书馆

他的首页他的馆藏他的动态馆友反馈关于他分享对话

htxu91

文章		关注		粉丝		访问		贡献

关注

粉丝

访问

贡献

他的首页

他的馆藏

他的动态

馆友反馈

关于他

共 1 篇文章

显示摘要

每页显示

条

stochastic

Stochastic gradient descent (SGD) simply updates each parameter by subtracting the gradient of the loss with respect to the parameter, scaled by the learning rate η, a hyperparameter. If η is too large, SGD will diverge; if it''s too small, it will converge slowly. The update rule is simplyθt+1=θt?η?L(θt...

阅81 转0 评0 公众公开 15-12-23 09:16

他的文章
他的书籍

筛选

不限类型

网文

撰写

文档

不限 Word PPT Excel RTF PDF TXT

思维导图

相册

音乐

视频

显示摘要不显示摘要

每页10条每页30条每页50条

返回
顶部