INDEX
Explanations
names of people or places related to various countries
proper nouns, particularly names and places
New Auto-Interp
Negative Logits
ruary
-0.86
sidx
-0.67
kefeller
-0.66
inki
-0.65
ogle
-0.65
espie
-0.60
CTR
-0.59
shove
-0.58
eret
-0.58
ournals
-0.58
POSITIVE LOGITS
oshenko
0.81
vic
0.71
Ħ¢
0.68
ova
0.64
guiActiveUnfocused
0.61
kaya
0.61
Graphics
0.61
Prix
0.61
Duchess
0.59
tiny
0.59
Activations Density 1.272%