INDEX
Explanations
phrases indicating contradiction or opposing viewpoints
New Auto-Interp
Negative Logits
+#+
-0.51
المشاركات
-0.46
numerusform
-0.45
Himo
-0.42
ってもら
-0.42
ять
-0.42
Awak
-0.39
šiť
-0.39
OFDb
-0.39
turun
-0.39
POSITIVE LOGITS
myth
0.74
rumors
0.71
myths
0.70
feared
0.68
initComponents
0.68
misleading
0.67
Rumors
0.65
claims
0.65
rumours
0.64
fears
0.64
Activations Density 0.491%