Publication:
ENSURING SAFETY WHILE ENHANCING PERFORMANCE: ENCOURAGING REINFORCEMENT LEARNING BY ADDRESSING CONSTRAINTS AND UNCERTAINTY

creativeworkseries.issn 2074-7128 (Print)
dc.contributor.authorAghbolagh, M. A.
dc.date.accessioned2024-10-03T12:31:01Z
dc.date.available2024-10-03T12:31:01Z
dc.date.issued2024
dc.description.abstractStriking a balance between safety and performance remains a critical concern, despite advancements in the field. To address this issue, a versatile framework named Safety Goes Along with Performance (SGAWP) is proposed, centered on off-policy algorithms grounded in value function optimization. SGAWP utilizes reinforcement learning to navigate the data space, emphasizing high task performance while addressing risks (such as undesirable states) by incorporating safety costs into the value function. By integrating uncertainty management and task performance constraints, SGAWP aims to achieve improved safety performance alongside respectable task performance. Moreover, SGAWP leverages curiosity-driven exploration to expand the data space and employs task policies to enhance safety policy performance. As a result, SGAWP enhances safety performance with minimal loss in task performance. Beyond its success in reinforcement learning, SGAWP holds promise for applications like autonomous driving, where safety is paramount. Through rigorous experimentation across various offpolicy algorithms, SGAWP demonstrates robust generalization and achieves its objectives effectively
dc.identifier.citationAGHBOLAGH, Mohsen Abdollahzadeh. ENSURING SAFETY WHILE ENHANCING PERFORMANCE: ENCOURAGING REINFORCEMENT LEARNING BY ADDRESSING CONSTRAINTS AND UNCERTAINTY. IT Security (Russia), [S.l.], v. 31, no. 2, p. 90–110, 2024. ISSN 2074-7136. URL: https://bit.spels.ru/index.php/bit/article/view/1635. DOI: http://dx.doi.org/10.26583/bit.2024.2.06.
dc.identifier.doi10.26583/bit.2024.2.06
dc.identifier.urihttps://openrepository.mephi.ru/handle/123456789/15495
dc.identifier.urihttps://bit.spels.ru/index.php/bit/article/view/1635
dc.identifier.urihttp://dx.doi.org/10.26583/bit.2024.2.06
dc.publisherНИЯУ МИФИ
dc.subjectRisk assessment
dc.subjectExploration
dc.subjectOff-policy
dc.subjectSafe reinforcement learning constraint
dc.titleENSURING SAFETY WHILE ENHANCING PERFORMANCE: ENCOURAGING REINFORCEMENT LEARNING BY ADDRESSING CONSTRAINTS AND UNCERTAINTY
dc.typeArticle
dspace.entity.typePublication
journal.titleБезопасность информационных технологий
journalvolume.identifier.nameБезопасность информационных технологий
relation.isJournalIssueOfPublicationebb6a468-d413-41c6-9f27-91e67a195035
relation.isJournalIssueOfPublication.latestForDiscoveryebb6a468-d413-41c6-9f27-91e67a195035
relation.isJournalOfPublication3b9ae913-eaeb-4d29-a767-7f6ca8a0e066
Файлы
Original bundle
Теперь показываю 1 - 1 из 1
Загружается...
Уменьшенное изображение
Name:
1635-2501-2-PB.pdf
Size:
1.46 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Теперь показываю 1 - 1 из 1
Загружается...
Уменьшенное изображение
Name:
license.txt
Size:
3.45 KB
Format:
Item-specific license agreed to upon submission
Description:
Коллекции