Candidate Hidden State - could replace and become the next memory vector
where means the previous hidden state will be entirely discarded, and 1 means entirely kept
New Hidden State - will replace by combining proportion of and
where is the element wise multiplication, means new hidden state is entirely based on candidate hidden vector , and 0 means entirely based on previous hidden state
Predicted output - defined as simple linear operation with a weight matrix