INDEX
Explanations
names of individuals involved in potentially controversial or criminal activities
phrases about criminal activities or legal issues
New Auto-Interp
Negative Logits
etheless
-1.01
EStream
-0.88
cius
-0.86
odox
-0.84
awei
-0.76
characterize
-0.75
summarize
-0.74
alach
-0.73
favorably
-0.73
cellaneous
-0.73
POSITIVE LOGITS
Pict
0.95
ITV
0.91
Notting
0.88
NHS
0.82
DUP
0.82
!'
0.79
NRL
0.79
footballer
0.79
Liverpool
0.78
Tory
0.77
Activations Density 0.136%