Neural RRT*

3 minute read

Published: August 13, 2025

Article Goal

Explain Neural RRT* (Neural Rapidly-exploring Random Tree Star) algorithm for learning-based optimal path planning

What is Neural RRT*?

Neural RRT* is a learning-based optimal path planning algorithm that combines Convolutional Neural Networks (CNNs) with the traditional RRT* algorithm to achieve faster convergence to optimal solutions. Neural RRT* addresses the key limitations of traditional RRT* - sensitivity to initial solutions and slow convergence - by using a pre-trained CNN model to predict probability distributions for optimal paths, enabling intelligent non-uniform sampling instead of random exploration.

The algorithm replaces RRT*’s uniform random sampling with a learned sampling distribution that biases exploration toward regions more likely to contain optimal paths. This neural guidance significantly reduces the time and computational resources needed to find high-quality solutions.

Neural RRT*: Written Walkthrough

Core Concepts

Learned Sampling Distribution - Uses a CNN to predict probability maps that guide where to sample next, replacing uniform random sampling.
Non-Uniform Intelligent Sampling - Samples are drawn from areas with higher probability of containing optimal paths rather than uniformly across the space.
CNN-Based Path Prediction - A pre-trained neural network analyzes the environment and predicts likely regions for optimal paths.
Hybrid Approach - Combines the asymptotic optimality guarantees of RRT* with the efficiency of learned heuristics.

The Math

Traditional RRT* Sampling

In standard RRT*, samples are drawn uniformly: \(x_{\text{rand}} \sim \text{Uniform}(\mathcal{X}_{\text{free}})\)

Neural RRT* Sampling Distribution

Neural RRT* uses a learned probability distribution: \(x_{\text{rand}} \sim P_{\theta}(x | \text{map}, x_{\text{start}}, x_{\text{goal}})\)

Where \(P_{\theta}\) is the CNN-predicted probability distribution parameterized by \(\theta\).

CNN Architecture

The neural network takes as input: \(\text{Input} = [\text{Map}, \text{Start}, \text{Goal}]\)

And outputs a probability heatmap: \(\text{Output} = P_{\theta}(x, y | \text{Input}) \in [0,1]^{H \times W}\)

Probability Normalization

The output probability map is normalized to ensure valid sampling: \(P_{\text{norm}}(x,y) = \frac{P_{\theta}(x,y)}{\sum_{i,j} P_{\theta}(i,j)}\)

Training Loss Function

The CNN is typically trained using successful paths from A* or other optimal planners: \(\mathcal{L} = -\sum_{i=1}^{N} \sum_{(x,y) \in \text{path}_i} \log P_{\theta}(x,y | \text{map}_i, s_i, g_i)\)

Where \(N\) is the number of training examples and \(\text{path}_i\) represents ground truth optimal paths.

Hybrid Sampling Strategy

Neural RRT* often uses a mixture of learned and uniform sampling: \(x_{\text{rand}} = \begin{cases} P_{\theta}(\cdot) & \text{with probability } \alpha \\ \text{Uniform}(\mathcal{X}_{\text{free}}) & \text{with probability } 1-\alpha \end{cases}\)

Algorithm Steps

Pre-Training Phase - Train CNN on dataset of optimal paths generated by A* or other complete planners
Initialize Tree - Start RRT* tree with initial configuration
Neural Prediction - Use trained CNN to generate probability heatmap for current environment
Intelligent Sampling - Sample new configurations from learned probability distribution
Traditional RRT* Operations - Perform standard nearest neighbor, steer, and collision check operations
Tree Extension - Add collision-free nodes to tree with rewiring for optimality
Adaptive Sampling - Optionally update probability distribution as tree grows
Convergence - Repeat until goal reached or computational budget exhausted

Advantages of Neural RRT*

Faster Convergence - Reaches near-optimal solutions significantly faster than traditional RRT*
Reduced Sample Complexity - Requires fewer random samples due to intelligent guidance
Environment Generalization - Pre-trained models can adapt to new similar environments
Maintains Optimality - Preserves asymptotic optimality guarantees of RRT*
Parallel Training - CNN training can be done offline and reused across problems
Reduced Computational Waste - Avoids sampling in obviously suboptimal regions

Limitations

Training Data Dependency - Requires large datasets of optimal solutions for effective training
Environment Specificity - CNN may not generalize well to environments significantly different from training data
Discretization Effects - Performance depends on resolution of probability heatmaps
Training Overhead - Initial CNN training requires significant computational resources
Memory Requirements - Must store neural network parameters and probability maps

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Neal Ramaswamy

Neural RRT*

Article Goal

What is Neural RRT*?

Core Concepts

The Math

Traditional RRT* Sampling

Neural RRT* Sampling Distribution

CNN Architecture

Probability Normalization

Training Loss Function

Hybrid Sampling Strategy

Algorithm Steps

Advantages of Neural RRT*

Limitations

Share on

You May Also Enjoy

Robot Arm 2D Forward / Inverse Kinematics

Model Predictive Control in Basic Autonomous Vehicle

Kalman Filter

Article Goal

Proportional-Integral-Derivative (PID) Control

Article Goal