INDEX
Explanations
phrases related to skepticism and criticism in the context of media and public perception
New Auto-Interp
Negative Logits
to
-0.15
UpInside
-0.15
almost
-0.14
luž
-0.14
алеж
-0.13
øns
-0.13
redo
-0.13
oin
-0.13
oplevel
-0.13
componentDid
-0.13
POSITIVE LOGITS
here
0.24
TOO
0.20
about
0.18
здеÑģÑĮ
0.18
Here
0.17
aquÃŃ
0.17
about
0.17
either
0.17
either
0.17
HERE
0.17
Activations Density 0.033%