INDEX
Explanations
words indicating intensity or frequency of actions or states
New Auto-Interp
Negative Logits
"]').
-0.77
="{{-0.76
UnusedPrivate
-0.75
']").
-0.74
")
-0.70
'=>$
-0.70
']);
-0.70
"){
-0.69
?>/
-0.68
}`).
-0.68
POSITIVE LOGITS
>=",
0.67
Demografie
0.64
zaidi
0.59
enough
0.58
tawesome
0.56
pení
0.55
خارجية
0.53
gawai
0.53
again
0.53
eclampsia
0.52
Activations Density 0.497%