INDEX
Explanations
instances of contractions or possessives in the text
New Auto-Interp
Negative Logits
afil
-0.18
å£°éŁ³
-0.14
ëı
-0.14
mv
-0.14
ãĤĩ
-0.14
mx
-0.14
ormsg
-0.14
SWG
-0.14
人åĵ¡
-0.13
redient
-0.13
POSITIVE LOGITS
if
0.16
LOPT
0.15
art
0.15
ataka
0.15
werk
0.14
most
0.14
.navigation
0.14
rib
0.14
alling
0.14
unkt
0.14
Activations Density 0.033%