INDEX
Explanations
the word "forget" or its variations
instances of the words "forget," "forgot," and their variations
New Auto-Interp
Negative Logits
amen
-0.67
berus
-0.66
coefficients
-0.66
orough
-0.65
Ec
-0.65
inals
-0.64
XY
-0.64
elled
-0.62
é¾
-0.62
tained
-0.61
POSITIVE LOGITS
fulness
1.16
fully
1.06
ful
1.03
ingly
0.78
lore
0.78
forgetting
0.77
forgot
0.77
noon
0.77
ening
0.75
remember
0.74
Activations Density 0.022%