INDEX
Explanations
words related to proper names or titles
references to value or worth in various contexts
New Auto-Interp
Negative Logits
unaff
-0.69
traffickers
-0.68
vana
-0.64
chronically
-0.60
meric
-0.60
unequ
-0.60
GO
-0.60
polar
-0.60
olar
-0.60
cyt
-0.60
POSITIVE LOGITS
shire
1.01
å§«
0.96
sburg
0.95
bridge
0.92
Borough
0.92
Abbey
0.91
tons
0.89
Heath
0.87
hire
0.85
shed
0.84
Activations Density 0.117%