INDEX
Explanations
names of individuals or locations with high relevance
proper nouns or names of individuals
New Auto-Interp
Negative Logits
envy
-0.90
FTWARE
-0.73
ambassadors
-0.69
ModLoader
-0.68
constants
-0.64
sophistic
-0.63
Reviewer
-0.62
CONT
-0.60
shorthand
-0.59
merce
-0.59
POSITIVE LOGITS
eman
0.94
uer
0.90
chuk
0.90
enberg
0.90
orst
0.89
stad
0.89
Jr
0.89
cker
0.89
kamp
0.88
linger
0.88
Activations Density 0.413%