INDEX
Explanations
proper names, particularly those of individuals
New Auto-Interp
Negative Logits
ritz
-0.16
obar
-0.16
itet
-0.15
autoload
-0.15
alog
-0.15
adol
-0.14
kah
-0.14
slash
-0.14
autoload
-0.14
orate
-0.14
POSITIVE LOGITS
Grande
0.35
Ari
0.30
Manchester
0.23
Manchester
0.20
Pete
0.20
Trafford
0.20
Davidson
0.19
Vict
0.19
Dangerous
0.19
grande
0.18
Activations Density 0.001%