INDEX
Explanations
proper nouns, especially names and locations, in the text
New Auto-Interp
Negative Logits
203
-0.15
fi
-0.15
ohn
-0.15
rieve
-0.14
è¨
-0.14
pil
-0.14
ilon
-0.14
102
-0.14
eni
-0.13
disp
-0.13
POSITIVE LOGITS
odos
0.18
ssql
0.15
#line
0.15
ãģ£ãģį
0.14
pie
0.14
AUD
0.14
_lite
0.14
éľ²
0.14
ados
0.14
ška
0.14
Activations Density 0.033%