You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This repository contains implementations of various reinforcement learning algorithms (DQN, DDQN TD3, PPO) for solving the Lunar Lander-v2 and BipedalWalker-v3 environment from OpenAI's Gym.
Algorithms Implemented
DQN (Deep Q-Network)
DDQN (Double Deep Q-Network)
TD3 (Twin Delayed Deep Deterministic Policy Gradient)
PPO (Proximal Policy Optimization)
Getting Started
1. Clone the Repository
git clone https://github.com/Viswesh-N/RL_BipedWalker.git
cd RL_BipedWalker
2. Set Up the Environment
It’s recommended to use a virtual environment for managing dependencies.
python -m venv env
source env/bin/activate # On Windows use `env\Scripts\activate`
3. Install Dependencies
Before running any commands, ensure you install all required Python packages.
pip install -r requirements.txt
4. Set Up the Python Path
To ensure that imports aren't broken, run the following command in the root directory of the repository:
export PYTHONPATH=$(pwd)
5. Running the Algorithms
DQN
Train the DQN or DDQN agent on the LunarLander-v2 environment:
Training logs are stored in the data/ directory, organized by algorithm name, environment, and timestamp. You can visualize the training progress using TensorBoard:
You can view the plots for DQN and TD3 training in the images/ directory.
DQN Plot:
TD3 Plot:
8. Customization
You can modify the configurations in the experiments/ folder to change hyperparameters, learning rates, or other settings specific to each algorithm.
9. Known Issues
LiDAR Raycasting Error: Sometimes, during training, you might encounter a r.LengthSquared() > 0.0f error due to numerical instability in the physics engine. The environment automatically resets when this occurs.