INDEX
Explanations
expressions of approval or agreement
short affirmative responses
New Auto-Interp
Negative Logits
ViewImports
-0.30
السكان
-0.23
typelib
-0.23
/\.
-0.23
spese
-0.23
Vidite
-0.22
boste
-0.21
共
-0.21
Stages
-0.21
εφ
-0.21
POSITIVE LOGITS
nakalista
0.72
IVEREF
0.71
TestingModule
0.69
الحياه
0.66
lanatory
0.65
فريبيس
0.64
yyb
0.63
Haha
0.63
Hahaha
0.59
autorytatywna
0.57
Activations Density 0.198%