Skip to content

cwei-bgl/drlnd_p1_navigation

 
 

Repository files navigation

Project 1: Navigation

Introduction

For this project, you will train an agent to navigate (and collect bananas!) in a large, square world.

Trained Agent

Environment Description

States

  • The state space has 37 dimensions.
  • Contains the agent's velocity.
  • As well as ray-based perception of objects around agent's forward direction.

Action

  • 0 - move forward.
  • 1 - move backward.
  • 2 - turn left.
  • 3 - turn right.

Rewards:

  • +1: collecting a yellow banana.
  • -1: for collecting a blue banana.

Goal:

  • To collect as many yellow bananas as possible while avoiding blue bananas.
  • It is considered solved if the agent get an average score of +13 over 100 consecutive episodes.

Project Structure

The repository contains the following files:

  • network.py Contains simple deep neural network.
  • dueling_network.py Contains a network implements Dueling Network from the paper
  • dqn_agent.py Contains Q-Network agent.
  • ddqn_agent.py Contains double Q-Network agent.
  • ddqn_prioritized_agent.py Contains double Q-Network agent with prioritized experience replay.
  • prioritized_replay_buffer.py Contains prioritized experience replay buffer implementation.
  • sum_tree.py Contains a more efficient priority-based sampling structure, the implementation of which references the one from Jaromir's blog post.
  • Navigation.ipynb Contains the agent training code for Unity Banana environment.
  • Report.md Contains the description of the implementation details.

Getting Started

  1. Install Anaconda(https://conda.io/docs/user-guide/install/index.html)
  2. Install dependencies by issue:
pip install -r requirements.txt
  1. Download the environment from one of the links below. You need only select the environment that matches your operating system:

  2. Place the file in the root folder, and unzip (or decompress) the file.

Instructions

Follow the steps in Navigation.ipynb to get started with training.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 86.7%
  • Python 13.3%