INDEX
Explanations
proper nouns, specifically names of people and locations
proper nouns, specifically names of individuals
New Auto-Interp
Negative Logits
nces
-1.02
fights
-0.73
amaz
-0.70
mate
-0.70
================
-0.70
ndra
-0.69
Ranked
-0.69
Ü
-0.68
mine
-0.66
mathemat
-0.66
POSITIVE LOGITS
ZI
0.90
Neill
0.78
Lennon
0.76
elson
0.75
Wick
0.75
steen
0.73
asonic
0.73
oyd
0.73
otiation
0.72
Advertisement
0.72
Activations Density 0.015%