TRIME
New training approach (Training with In-batch Memories)
TRIME = three types of memories
essentially adds another matrix to the final embedding matrix to account for long range memory
Binding Language Models in Symbolic Languages