INDEX
Explanations
criticisms or arguments addressed towards different beliefs, policies, or individuals
phrases that express misconceptions or criticisms related to effort and performance
New Auto-Interp
Negative Logits
»Ĵ
-0.68
anooga
-0.56
)."
-0.55
'';
-0.53
idav
-0.53
gencies
-0.53
EStreamFrame
-0.51
cture
-0.50
}.
-0.49
nesota
-0.49
POSITIVE LOGITS
"â̦
0.65
Doomsday
0.56
UFOs
0.56
"[
0.55
pedoph
0.55
actually
0.54
Paddock
0.53
Canaver
0.53
actually
0.52
Blumenthal
0.51
Activations Density 2.038%