A new approach to training neural networks using natural gradient descent with momentum based on Dirichlet distributions
In this paper, we propose a natural gradient descent algorithm with momentum based on Dirichlet distributions to speed up the training of neural networks. This approach takes into account not only the direction of the gradients, but also the convexity of the minimized function, which significantly a...
Сохранить в:
| Главные авторы: | , , , |
|---|---|
| Формат: | Статья |
| Язык: | English |
| Опубликовано: |
2023
|
| Темы: | |
| Online-ссылка: | https://dspace.ncfu.ru/handle/20.500.12258/23476 |
| Метки: |
Добавить метку
Нет меток, Требуется 1-ая метка записи!
|