INDEX
Explanations
repeated usage of the word "going."
New Auto-Interp
Negative Logits
fubject
-0.77
חיצוניים
-0.75
ekš
-0.72
themſelves
-0.71
BagConstraints
-0.70
volezza
-0.67
anhänger
-0.67
herren
-0.65
máscara
-0.65
Montaigne
-0.65
POSITIVE LOGITS
going
1.90
Going
1.77
GOING
1.71
going
1.67
Going
1.67
GOING
1.47
goin
1.35
goin
1.11
Gonna
1.00
coming
0.98
Activations Density 0.042%