INDEX
Explanations
evaluative statements and assertions regarding claims and beliefs
New Auto-Interp
Negative Logits
estring
-0.17
elib
-0.15
onation
-0.15
лава
-0.15
)((((
-0.15
ÑģÑĮ
-0.14
onet
-0.14
earn
-0.14
alli
-0.14
ISCO
-0.14
POSITIVE LOGITS
said
0.23
say
0.19
à¤ķहन
0.17
saying
0.17
said
0.17
description
0.17
credit
0.17
described
0.16
.say
0.16
regard
0.16
Activations Density 0.170%