所有版本 - 学术资源搜索

文章

学术资源搜索

获得 4 条结果（用时0.03秒）

[HTML][HTML] Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge

A Beikmohammadi, S Magnússon - Information Sciences, 2024 - Elsevier

Despite the huge success of reinforcement learning (RL) in solving many difficult problems,
its Achilles heel has always been sample inefficiency. On the other hand, in RL, taking …

被引用次数：3 相关文章

Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge

A Beikmohammadi, S Magnússon - 2024 - dl.acm.org

Despite the huge success of reinforcement learning (RL) in solving many difficult problems,
its Achilles heel has always been sample inefficiency. On the other hand, in RL, taking …

Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge

A Beikmohammadi, S Magnússon - Information Sciences, 2024 - diva-portal.org

Despite the huge success of reinforcement learning (RL) in solving many difficult problems,
its Achilles heel has always been sample inefficiency. On the other hand, in RL, taking …

[PDF] researchgate.net

[PDF][PDF] Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge

A Beikmohammadi, S Magnússon - Information Sciences, 2024 - researchgate.net

Despite the huge success of reinforcement learning (RL) in solving many difficult problems,
its Achilles heel has always been sample inefficiency. On the other hand, in RL, taking …

高级搜索

QQ 群

[HTML][HTML] Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge

Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge

Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge

[PDF][PDF] Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge

引用