INDEX
Explanations
proper nouns or names in text
specific names and titles related to people, places, and organizations
New Auto-Interp
Negative Logits
astical
-0.74
Gaul
-0.71
quartered
-0.67
rious
-0.66
ullah
-0.65
nikov
-0.64
pard
-0.61
åĬ
-0.61
ophob
-0.61
tics
-0.60
POSITIVE LOGITS
tracks
0.72
ITION
0.72
debit
0.64
ĨĴ
0.63
wagon
0.62
ERT
0.61
sor
0.60
metics
0.59
cert
0.59
DEV
0.59
Activations Density 0.360%