INDEX
Explanations
negations and words indicating disagreement or denial
"No" followed by words indicating doubt or uncertainty
no followed by clarification or negation
New Auto-Interp
Negative Logits
متعلقه
-0.70
számára
-0.68
ktır
-0.59
trás
-0.56
textStatus
-0.56
Roskov
-0.56
колко
-0.56
незавершена
-0.55
refusé
-0.55
Bioaccumulative
-0.53
POSITIVE LOGITS
matter
1.55
wonder
1.05
doubt
1.00
matter
0.97
MATTER
0.92
worries
0.89
Matter
0.89
offense
0.89
Matter
0.88
longer
0.87
Activations Density 0.092%