INDEX
Explanations
terms and phrases related to political topics and discussions
New Auto-Interp
Negative Logits
Politics
-0.32
politics
-0.32
politic
-0.30
POLIT
-0.28
politically
-0.26
politik
-0.26
politique
-0.25
Political
-0.24
polÃŃtica
-0.24
political
-0.24
POSITIVE LOGITS
correctness
0.29
incorrect
0.26
Incorrect
0.23
Correct
0.21
/legal
0.20
incorrect
0.20
-economic
0.20
parties
0.20
æŃ£ç¡®
0.20
Parties
0.19
Activations Density 0.037%