Publication: Memristor-based spiking neural network with online reinforcement learning
| dc.contributor.author | Vlasov, D. | |
| dc.contributor.author | Minnekhanov, A. | |
| dc.contributor.author | Rybka, R. | |
| dc.contributor.author | Davydov, Y. | |
| dc.contributor.author | Sboev, A. G. | |
| dc.contributor.author | Сбоев, Александр Георгиевич | |
| dc.date.accessioned | 2024-12-27T08:50:56Z | |
| dc.date.available | 2024-12-27T08:50:56Z | |
| dc.date.issued | 2023 | |
| dc.description.abstract | Neural networks implemented in memristor-based hardware can provide fast and efficient in-memory computation, but traditional learning methods such as error back-propagation are hardly feasible in it. Spiking neural networks (SNNs) are highly promising in this regard, as their weights can be changed locally in a self-organized manner without the demand for high-precision changes calculated with the use of information almost from the entire network. This problem is rather relevant for solving control tasks with neural-network reinforcement learning methods, as those are highly sensitive to any source of stochasticity in a model initialization, training, or decision-making procedure. This paper presents an online reinforcement learning algorithm in which the change of connection weights is carried out after processing each environment state during interaction-with-environment data generation. Another novel feature of the algorithm is that it is applied to SNNs with memristor-based STDP-like learning rules. The plasticity functions are obtained from real memristors based on poly-p-xylylene and CoFeB-LiNbO | |
| dc.format.extent | С. 512-523 | |
| dc.identifier.citation | Memristor-based spiking neural network with online reinforcement learning / Vlasov, D. [et al.] // Neural Networks. - 2023. - 166. - P. 512-523. - 10.1016/j.neunet.2023.07.031 | |
| dc.identifier.doi | 10.1016/j.neunet.2023.07.031 | |
| dc.identifier.uri | https://www.doi.org/10.1016/j.neunet.2023.07.031 | |
| dc.identifier.uri | https://www.scopus.com/record/display.uri?eid=2-s2.0-85167573829&origin=resultslist | |
| dc.identifier.uri | http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=Alerting&SrcApp=Alerting&DestApp=WOS_CPL&DestLinkType=FullRecord&UT=WOS:001066065400001 | |
| dc.identifier.uri | https://openrepository.mephi.ru/handle/123456789/29148 | |
| dc.relation.ispartof | Neural Networks | |
| dc.subject | Memristor | |
| dc.subject | Initialization | |
| dc.subject | Neuromorphic engineering | |
| dc.subject | Benchmark (surveying) | |
| dc.subject | Spiking Neurons | |
| dc.subject | Working Memory | |
| dc.subject | Brain-inspired Computing | |
| dc.subject | Resistive Switching | |
| dc.subject | Neuromorphic Computing | |
| dc.title | Memristor-based spiking neural network with online reinforcement learning | |
| dc.type | Article | |
| dspace.entity.type | Publication | |
| oaire.citation.volume | 166 | |
| relation.isAuthorOfPublication | fc2d63d7-5260-41ba-a952-0420c8848b13 | |
| relation.isAuthorOfPublication.latestForDiscovery | fc2d63d7-5260-41ba-a952-0420c8848b13 | |
| relation.isOrgUnitOfPublication | ba0b4738-e6bd-4285-bda5-16ab2240dbd1 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | ba0b4738-e6bd-4285-bda5-16ab2240dbd1 |