Order-preserving gflownets

文章

学术资源搜索

获得 5 条结果（用时0.02秒）

我的图书馆

在引用文章中搜索

[PDF] arxiv.org

Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

F Yu, L Jiang, H Kang, S Hao, L Qin - arXiv preprint arXiv:2406.05673, 2024 - arxiv.org

Divergent thinking, the cognitive process of generating diverse solutions, is a hallmark of
human creativity and problem-solving. For machines, sampling diverse solution trajectories …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Rectifying Reinforcement Learning for Reward Matching

H He, E Bengio, Q Cai, L Pan - arXiv preprint arXiv:2406.02213, 2024 - arxiv.org

The Generative Flow Network (GFlowNet) is a probabilistic framework in which an agent
learns a stochastic policy and flow functions to sample objects with probability proportional …

Generative Flow Networks: Theory and Applications to Structure Learning

T Deleu - arXiv preprint arXiv:2501.05498, 2025 - arxiv.org

Without any assumptions about data generation, multiple causal models may explain our
observations equally well. To avoid selecting a single arbitrary model that could result in …

[PDF] arxiv.org

Improving GFlowNets with Monte Carlo Tree Search

N Morozov, D Tiapkin, S Samsonov, A Naumov… - arXiv preprint arXiv …, 2024 - arxiv.org

Generative Flow Networks (GFlowNets) treat sampling from distributions over compositional
discrete spaces as a sequential decision-making problem, training a stochastic policy to …

被引用次数：1 相关文章

[PDF] arxiv.org

Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

R Hu, Y Zhang, Z Li, L Huang - arXiv preprint arXiv:2410.02596, 2024 - arxiv.org

Generative Flow Networks (GFlowNets) are a novel class of generative models designed to
sample from unnormalized distributions and have found applications in various important …

高级搜索

QQ 群