INDEX
Explanations
repeated mentions of names
words ending in ock or ick
New Auto-Interp
Negative Logits
Vitale
-0.56
┑
-0.43
nmgp
-0.42
Lerner
-0.41
Lag
-0.40
Acqu
-0.40
Su
-0.40
解
-0.39
صوتيه
-0.39
.
-0.39
POSITIVE LOGITS
nock
2.98
rick
1.53
RICK
1.07
OCK
0.99
noch
0.93
dock
0.90
ricks
0.90
nok
0.88
ock
0.80
omock
0.79
Activations Density 0.002%