Deep Reinforcement Learning

4 items

RESEARCHDEV.to AI·4/11/2026

End-to-End Deep Reinforcement Learning for Lane Keeping Assist

This content focuses on the application of end-to-end Deep Reinforcement Learning for the development of lane-keeping assist systems. The research explores the use of advanced AI to enhance vehicle safety and autonomy.

Deep Reinforcement Learning Machine Learning autonomous driving Lane Keeping Assist

RESEARCHarXiv CS.AI·4/13/2026

RAMP: Hybrid DRL for Online Learning of Numeric Action Models

RAMP proposes a novel strategy for learning numeric planning action models online through environmental interactions, integrating Deep Reinforcement Learning (DRL), action model learning, and planning. This creates a positive feedback loop where the RL policy gathers data to refine the action model, while the planner generates plans to continue training the RL policy.

Deep Reinforcement Learning Action Model Learning Numeric Planning reinforcement learning

RESEARCHarXiv CS.AI·4/6/2026

Interpretable Deep Reinforcement Learning for Element-level Bridge Life-cycle Optimization

O artigo aborda a aplicação de Aprendizado por Reforço Profundo interpretável para a otimização do ciclo de vida de pontes em nível de elemento. Ele busca oferecer transparência e eficiência na gestão da infraestrutura.

Deep Reinforcement Learning Optimization interpretable AI Civil Engineering

RESEARCHarXiv CS.AI·4/7/2026

When Adaptive Rewards Hurt: Causal Probing and the Switching-Stability Dilemma in LLM-Guided LEO Satellite Scheduling

Este artigo de pesquisa explora o design adaptativo de recompensas para DRL no agendamento de satélites LEO, revelando um dilema de estabilidade onde pesos de recompensa estáticos superam os dinâmicos devido à necessidade de um sinal quase estacionário para o PPO. O estudo introduz um método de sondagem causal para identificar a alavancagem de termos de recompensa específicos, descobrindo que um aumento na penalidade de switching melhora significativamente a taxa de dados.

Deep Reinforcement Learning satellite scheduling Reward Design