Xiaohei's Blog
headpicBlur image
Rating Criteria
Mediocre paper
Regular paper
Widely recognized or eye-catching paper
Foundational paper or one I really love
Undoubtedly the best paper

RL-BASED DRONE PAPER READING

Planar Pose Graph#

Arxiv ID 2209.08543
幻觉翻译 2209.08543

A stability framework for planar pose optimization

A stability framework for planar pose optimization. By decoupling the problem into two subproblems and analyzing them, it converts the original matrix-form problem into a linear one (simpler than the matrix formulation), and finally optimizes the target by solving the linear problem.

推荐指数:

Whole-Body Scale Optimization#

Arxiv ID 2208.06331
幻觉翻译 2208.06331

Accurate whole-body collision trajectory prediction with linear scaling

show It proposes an accurate whole-body collision formulation with linear scaling, and derives analytic gradients for trajectory optimization.

推荐指数:

UAV Payload Transportation#

Arxiv ID 2310.15050
幻觉翻译 2310.15050

Planning and control for complex structures

show1 Control of the UAV attitude and the payload attitude during heavy-load transportation, enabling automatic swing suppression.

推荐指数:

Robotic Relative Localization#

Arxiv ID 2210.08265
幻觉翻译 2210.08265

Swarm localization

show2 Relative localization for a robot swarm with mutual observations. It addresses recovering relative poses for partially mutually observed robot groups and proposes a robust, scalable algorithm.

推荐指数:

Path Planning#

Arxiv ID 2209.13159
幻觉翻译 2209.13159

Path planning

show3 Topic: improve view-path planning efficiency for autonomous implicit reconstruction and enhance reconstruction quality of target images. Methods: (1) approximate information-gain fields (2) hybrid representations (3) a new path-planning strategy.

推荐指数:

GS-Planner#

Arxiv ID 2405.10142
幻觉翻译 2405.10142

Gaussian-Splatting-based Planning Framework

show4 3D Gaussian reconstruction (mapping + path planning, with emphasis on mapping): it proposes complete and mapping-quality evaluation metrics for 3D Gaussian mapping, and designs a sampling-based active view-planning algorithm to guide reconstruction of unobserved regions and improve mapping quality.

推荐指数:

Back to Newton’s Laws#

Arxiv ID 2407.10648
幻觉翻译 2407.10648

Vision-based optimal control

show5 This is not a paper in the reinforcement learning scope. As its title suggests, it goes back to Newton’s laws and uses physical approaches to achieve high-speed UAV flight, still based on vision algorithms.

推荐指数:

ARiADNE#

Arxiv ID 2301.11575
幻觉翻译 2301.11575

Attention-based reinforcement learning method

show6 ARiADNE is an attention-based RL method for autonomous exploration in unknown robot environments, addressing short-sighted planning. Key contribution: use an attention network to learn multi-scale spatial dependencies in maps, combined with SAC to achieve non-myopic path decisions. It outperforms frontier-based methods, sampling-based methods, and CNN-DRL baselines in exploration efficiency, and is validated for practicality in ROS simulation.

推荐指数:

CTSAC#

Arxiv ID 2503.14254
幻觉翻译 2503.14254

Transformer + SAC with periodic-review curriculum learning and LiDAR clustering optimization

show7 To address weak environment reasoning, slow convergence, and hard sim-to-real transfer in goal-directed autonomous robot exploration, it proposes CTSAC: it embeds a Transformer into the SAC perception network to use history information and improve foresight, designs a periodic-review curriculum learning scheme to mitigate catastrophic forgetting and accelerate training, and optimizes LiDAR clustering to narrow the sim-to-real gap. ROS-Gazebo simulation and real-robot experiments show higher exploration success rate and efficiency than traditional non-learning methods and mainstream learning-based algorithms.

推荐指数:

DARE#

Arxiv ID 2410.16687
幻觉翻译 2410.16687

Diffusion policy + attention map encoder to generate exploration paths from optimal expert demonstrations

show8 It proposes DARE, a generative method for autonomous robot exploration. It uses an attention encoder to extract environment-map features, and a diffusion-policy network to learn exploration patterns from optimal expert demonstrations. It can infer the structure of unknown regions based on local environment beliefs and generate explicit long-horizon planned paths. Simulation and real-world deployment show exploration efficiency comparable to mainstream traditional and learning-based planners, with strong generalization and sim-to-real transfer.

推荐指数:
Paper Reading: Robot learning 2
https://xiaohei94.github.io/en/blog/paper-reading-2
Author 红鼻子小黑
Published at August 8, 2025
Comment seems to stuck. Try to refresh?✨