INDEX
Explanations
the word "never" followed by a verb indicating a negative action or statement
repetitive phrases starting with "Never."
New Auto-Interp
Negative Logits
nikov
-0.69
redients
-0.68
ahime
-0.64
?????-
-0.63
eers
-0.59
emia
-0.58
iency
-0.58
otle
-0.57
atics
-0.57
AAA
-0.57
POSITIVE LOGITS
theless
1.77
mind
1.14
winter
1.04
ceases
0.91
mind
0.87
hesitate
0.84
underestimate
0.80
EVER
0.78
again
0.75
forgotten
0.74
Activations Density 0.042%