INDEX
Explanations
words related to physical locations or persons
occurrences of the substring "ond"
New Auto-Interp
Negative Logits
mson
-0.85
kson
-0.65
======
-0.64
jriwal
-0.62
CLE
-0.61
GBT
-0.61
plates
-0.59
glers
-0.59
HCR
-0.57
IPS
-0.57
POSITIVE LOGITS
ragon
1.26
rive
1.02
erer
1.01
ering
0.99
orf
0.98
ition
0.96
romeda
0.93
roit
0.89
itional
0.89
imensional
0.88
Activations Density 0.028%