INDEX
Explanations
instances where something is being forgotten or ignored
instances of the word "forget" and its variations
New Auto-Interp
Negative Logits
XY
-0.70
amen
-0.69
tained
-0.69
Parser
-0.69
nomine
-0.68
coefficients
-0.68
arte
-0.68
inals
-0.67
uana
-0.67
ullivan
-0.65
POSITIVE LOGITS
fulness
1.15
fully
0.99
ful
0.98
theless
0.91
forgetting
0.84
forgot
0.83
forget
0.83
remember
0.79
remembered
0.78
ingly
0.77
Activations Density 0.020%