⇦ Back to lesson

Question 4

How do LSTM networks maintain a constant error flow through time, and why is this important in addressing the vanishing gradient problem?