INDEX
Explanations
political terms or concepts
mentions of political issues and concepts
New Auto-Interp
Negative Logits
upon
-0.75
Om
-0.71
shall
-0.71
Landing
-0.68
tered
-0.68
ance
-0.68
Trinity
-0.68
Peaks
-0.68
Shannon
-0.68
anta
-0.67
POSITIVE LOGITS
affili
0.89
correctness
0.88
disag
0.84
correct
0.82
politically
0.80
minded
0.79
aven
0.77
Pengu
0.76
oriented
0.75
savvy
0.74
Activations Density 0.005%