INDEX
Explanations
names with "Wid" or "Tud", possibly last names
proper nouns, particularly names of individuals and locations
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.76
retaliate
-0.68
Leban
-0.68
compr
-0.64
Collider
-0.62
IZE
-0.62
pmwiki
-0.60
residual
-0.60
tune
-0.60
ãĥĺ
-0.59
POSITIVE LOGITS
gets
1.18
ows
1.06
owed
0.94
erer
0.90
artz
0.82
nesday
0.81
Wid
0.80
ening
0.79
NT
0.79
erers
0.77
Activations Density 0.028%