INDEX
Explanations
instances of discussions or controversies related to politics and policies
New Auto-Interp
Negative Logits
ãĤ´ãĥ³
-0.80
alo
-0.72
ãĤ¦ãĤ¹
-0.68
alogy
-0.64
afterlife
-0.63
opia
-0.63
vre
-0.63
prev
-0.61
Widget
-0.61
Probably
-0.60
POSITIVE LOGITS
however
0.77
when
0.71
Newsweek
0.69
rumors
0.68
Attorney
0.67
VICE
0.67
tensions
0.67
researchers
0.66
Stephanie
0.65
WikiLeaks
0.65
Activations Density 2.497%