INDEX
Explanations
names related to politics and legal matters
references to specific individuals, particularly Paul Manafort
New Auto-Interp
Negative Logits
RAL
-0.73
ALE
-0.72
CAST
-0.72
à¨
-0.71
ĪĴ
-0.71
ĸļ
-0.70
RD
-0.70
@#&
-0.70
ä¸ī
-0.66
NESS
-0.65
POSITIVE LOGITS
afort
1.24
Manafort
1.07
uchin
0.85
ossier
0.78
umenthal
0.76
iola
0.76
indicted
0.74
aide
0.73
confid
0.73
andowski
0.71
Activations Density 0.006%