INDEX
Explanations
words or phrases that express strong negative feelings or accusations towards political opponents and the media
New Auto-Interp
Negative Logits
ValueStyle
-0.81
:]:
-0.78
出版年
-0.78
ustimmung
-0.74
gynhyrchwyd
-0.73
richTextPanel
-0.73
$")
-0.72
"])
-0.71
]$}
-0.71
"})
-0.69
POSITIVE LOGITS
begleiten
0.44
L
0.44
Lexikon
0.43
umb
0.41
asas
0.41
persevere
0.40
I
0.39
D
0.38
now
0.38
насе
0.38
Activations Density 0.126%