INDEX
Explanations
instances of the word "town."
New Auto-Interp
Negative Logits
coni
-0.19
ylland
-0.18
orgia
-0.17
ulers
-0.17
yonel
-0.17
oples
-0.16
orsch
-0.15
jÃŃm
-0.14
riday
-0.14
uten
-0.14
POSITIVE LOGITS
bridge
0.18
-wide
0.17
ships
0.17
ypass
0.15
wide
0.15
sville
0.15
lif
0.15
/state
0.15
spo
0.14
ore
0.14
Activations Density 0.025%