INDEX
Explanations
instances of strong verbs and adjectives indicating actions or conditions
New Auto-Interp
Negative Logits
.weixin
-0.20
ensch
-0.15
ãĥ¼ãĥĬ
-0.15
'gc
-0.14
ifo
-0.14
ocities
-0.14
ctal
-0.14
unfinished
-0.14
uncture
-0.14
fikir
-0.14
POSITIVE LOGITS
ilon
0.17
panel
0.16
meli
0.15
atta
0.15
Panel
0.15
_CODEC
0.14
dem
0.14
ault
0.14
ario
0.14
Gal
0.14
Activations Density 0.045%