INDEX
Explanations
pronouns and names of individuals
New Auto-Interp
Negative Logits
Seym
-0.79
pregn
-0.60
Leban
-0.60
classic
-0.59
etheless
-0.59
Composite
-0.56
itaire
-0.56
igmatic
-0.56
+/-
-0.54
Operation
-0.54
POSITIVE LOGITS
zbollah
1.20
resy
0.95
Majesty
0.90
ading
0.83
gemony
0.83
'll
0.83
mos
0.82
eded
0.81
eding
0.81
brew
0.81
Activations Density 0.206%