Multi-Armed Bandit Simulation

This Jupyter notebook implements a simulation of the multi-armed bandit problem using a stochastic approach. It provides a framework for creating and running bandit simulations, which can be useful for studying reinforcement learning algorithms and decision-making under uncertainty.

Features

Implements Bernoulli bandits with random probability distributions
Simulates multi-armed bandit games with a configurable number of bandits and time steps
Provides a function to run multiple simulations and average the results
Includes example usage for both a single game and a full simulation

Requirements

Python 3.x
Jupyter Notebook or JupyterLab
NumPy

Installation

Ensure you have Python 3.x installed on your system.
Install Jupyter Notebook if you haven't already:
```
pip install jupyter
```
Install NumPy if you haven't already:
```
pip install numpy
```
Download the Multi_Armed_Bandit_Simulation.ipynb file to your local machine.

Usage

Running the Notebook

Navigate to the directory containing the notebook in your terminal or command prompt.
Start Jupyter Notebook:
```
jupyter notebook
```
In the Jupyter interface that opens in your web browser, click on Multi_Armed_Bandit_Simulation.ipynb to open it.
You can run individual cells by selecting them and pressing Shift+Enter, or run all cells from the "Cell" menu by selecting "Run All".

Customizing the Simulation

To customize the simulation parameters, modify the values in the cells containing the example usage. For instance:

# Run a simple game
game = BanditsGame(K=5, T=50)  # 5 bandits, 50 time steps
game.run_stochastic()

# Run the full simulation
stochastic_results = run_simulation(n_runs=20, runs_per_game=200, K=5, T=2000)

Using in Your Own Projects

You can copy the relevant cells containing the classes and functions from this notebook to use in your own Jupyter notebooks or Python scripts.

Notebook Structure

The notebook is structured as follows:

Introduction and imports
BernoulliBandit class definition
BanditsGame class definition
run_simulation function definition
Example of running a simple game
Example of running a full simulation
Results analysis and visualization (if applicable)

Extending the Code

You can extend this simulation by:

Implementing different types of bandits (e.g., Gaussian bandits)
Adding new bandit selection strategies (e.g., epsilon-greedy, UCB)
Implementing visualization of the results
Adding more complex reward structures

Contributing

Feel free to fork this project, submit pull requests, or suggest improvements by opening an issue in the project repository.

License

This project is open-source and available under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
Multi_Armed_Bandit_Simulation.ipynb		Multi_Armed_Bandit_Simulation.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Armed Bandit Simulation

Features

Requirements

Installation

Usage

Running the Notebook

Customizing the Simulation

Using in Your Own Projects

Notebook Structure

Extending the Code

Contributing

License

About

Releases

Packages

Languages

License

chirag1701/Multi_armed_bandits

Folders and files

Latest commit

History

Repository files navigation

Multi-Armed Bandit Simulation

Features

Requirements

Installation

Usage

Running the Notebook

Customizing the Simulation

Using in Your Own Projects

Notebook Structure

Extending the Code

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages