The project is a pytorch implementation of lip net which is based on a Convlution LSTM model for lip reading. The dataset used in the project consist of 1000 vidoes uttering random words e.g. "place white at e seven again".
The model was trained with the following features
sequence_length = 35
batch_size = 16
bidirectional = True
learning_rate = 0.001
