INDEX
Explanations
phrases indicating group involvement or collective identity
New Auto-Interp
Negative Logits
+#+#
-0.99
utafitiHapana
-0.84
فريبيس
-0.81
الرياضيه
-0.76
Externé
-0.74
виправивши
-0.72
tvguidetime
-0.71
transfieras
-0.69
bootstrapcdn
-0.69
елның
-0.68
POSITIVE LOGITS
believe
0.71
believe
0.63
anticipate
0.61
constaté
0.61
believes
0.61
expect
0.60
recognize
0.59
reconhe
0.59
reconoc
0.58
commend
0.58
Activations Density 0.180%