INDEX
Explanations
expressions and mentions of opinions
New Auto-Interp
Negative Logits
бла
-0.78
■■
-0.70
delaire
-0.68
ьаж
-0.67
/***/
-0.63
Pvt
-0.63
endish
-0.63
برز
-0.63
Byzantium
-0.61
lojik
-0.61
POSITIVE LOGITS
Opinion
1.47
opinion
1.46
opinions
1.41
Opinions
1.35
opinion
1.32
Opin
1.30
Opinion
1.19
Opinions
1.16
opinions
1.16
OPINION
1.15
Activations Density 0.091%