INDEX
Explanations
words related to socio-political issues and current events
references to political and social issues
New Auto-Interp
Negative Logits
.).
-0.60
}.
-0.55
guiActiveUnfocused
-0.54
?).
-0.54
".
-0.54
]."
-0.54
$.
-0.53
!".
-0.51
)).
-0.51
().
-0.50
POSITIVE LOGITS
':
0.45
Grassley
0.43
Haz
0.41
Profile
0.40
Lavrov
0.39
Updated
0.38
Investigative
0.37
ulative
0.37
Frontier
0.37
U
0.37
Activations Density 5.170%