INDEX
Explanations
proper nouns or names
proper nouns related to organizations, places, and brands
New Auto-Interp
Negative Logits
etheless
-0.61
separatist
-0.60
depreciation
-0.55
inherit
-0.54
plag
-0.54
dracon
-0.53
Rebels
-0.53
retaliate
-0.53
polarized
-0.53
sort
-0.53
POSITIVE LOGITS
ona
1.00
oya
0.88
isha
0.83
inda
0.81
ley
0.80
ado
0.79
leys
0.79
onda
0.79
aci
0.77
ena
0.77
Activations Density 0.457%