INDEX
Explanations
proper nouns or specific named entities
references to specific geographic locations or terms related to geography
New Auto-Interp
Negative Logits
oku
-0.66
hands
-0.60
Azerbaijan
-0.60
onym
-0.60
avoid
-0.59
sylv
-0.59
fig
-0.58
tsky
-0.58
guide
-0.58
stub
-0.58
POSITIVE LOGITS
enhagen
0.82
owship
0.77
andise
0.76
ioxide
0.74
ruary
0.70
imately
0.69
elli
0.68
iously
0.67
coni
0.66
ulin
0.66
Activations Density 0.420%