INDEX
Explanations
verbs and pronouns that indicate current states or ongoing actions
New Auto-Interp
Negative Logits
ga
-0.47
:
-0.46
frame
-0.44
ppi
-0.44
bootstrapcdn
-0.43
Turquía
-0.42
ant
-0.41
TestBase
-0.40
ss
-0.40
because
-0.39
POSITIVE LOGITS
Roskov
0.92
&___
0.85
nahilalakip
0.83
<?
0.77
featureID
0.77
oa̍t
0.74
__":
0.73
!*\
0.73
abestanden
0.72
pinulongan
0.71
Activations Density 0.134%