RNN-Text-Gen

A small language model using a Recurrent Neural Network to generate text based on a corpus.

Execution

To launch the code, run the following command:

cargo run --release <architecture> <filepath>

The architecture provided must be equal to rnn (for a vanilla Recurrent Neural Network) or lstm (for a Long Short-Term Memory architecture). The parameter passed in <filepath> is used as a training text for the network. A showcase file containing 40k lines of Shakespeare's plays is provided in "data/shakespeare-40k.txt".

The implementation is done is Rust and uses the nalgebra library for linear algebra computations. The vanilla RNN implementation is based on this blog post by Andrej Karpathy, that I adapted to support LSTM.

Hyperparameters are defined in src/rnn.rs; you can for instance change the number of neurons in the hidden layer.

Theory

Vanilla RNN

The simple RNN uses the following recurring equations:

$$\begin{align*} h_t &= \tanh\left(W_{hh}h_{t-1}+W_{xh}x_t + b_h\right)\\\ y_t &= W_{hy}h_t+b_y \end{align*}$$

LSTM

The Long Short-Term Memory architecture uses the following recurring equations:

$$\begin{align*} \begin{bmatrix} i\\\ f\\\ o\\\ g \end{bmatrix} &= \begin{bmatrix} \sigma\\\ \sigma\\\ \sigma\\\ \tanh \end{bmatrix} W \begin{bmatrix} h_{t-1}\\\ x_t \end{bmatrix} \\\ c_t &= f \odot c_{t-1} + i\odot g\\\ h_t &= o \odot \tanh(c_t) \end{align*}$$

where $\odot$ denotes the Hadamard (component-wise) product, .component_mul in the Rust code.

Optimization (AdaGrad)

The optimization is done with AdaGrad. For each parameter $\theta_{t,i}$:

$$\begin{align*} m_{t+1,i} &= m_{t,i} + \nabla\mathcal{L}(\theta_t)_i^2\\\ \theta_{t+1,i} &= \theta_{t,i} - \frac{\eta}{\sqrt{m_{t+1,i}+\varepsilon}}\cdot\nabla\mathcal{L}(\theta_t)_i \end{align*}$$

where $\varepsilon=10^{-8}$ and $\eta$ is the learning rate, $0.1$ in the code.

Result sample

After ... iterations of LSTM for sequences of length 25, I obtained this following result sample on the shakespeare-40k.txt dataset:

QUEEN ELIZABETH:
The fraine thy most a twagenest.
Dle staffold!
To you doubs,
And weal drief.

PRESBERHA:
I ungo to cursess witor'd; us lave whil, than enough.

KING RICHARD III:
That lighterde for thy dascore defol,
And with leavend my sagn of the good burtief.
Thou wor he art I. Prince, but bournon they have my lord.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
data		data
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RNN-Text-Gen

Execution

Theory

Vanilla RNN

LSTM

Optimization (AdaGrad)

Result sample

About

Releases

Packages

Languages

Red-Rapious/RNN-Text-Gen

Folders and files

Latest commit

History

Repository files navigation

RNN-Text-Gen

Execution

Theory

Vanilla RNN

LSTM

Optimization (AdaGrad)

Result sample

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages