INDEX
Explanations
occurrences of a specific word
mentions of a specific surname or term related to individuals involved in a news context
New Auto-Interp
Negative Logits
weeney
-0.77
xon
-0.72
urgical
-0.63
asers
-0.63
xual
-0.63
Caldwell
-0.61
bard
-0.60
Cobra
-0.59
appropriate
-0.59
SEC
-0.58
POSITIVE LOGITS
itives
1.17
erer
0.78
s
0.78
sworth
0.75
istani
0.73
aby
0.73
ief
0.73
ened
0.72
itive
0.72
ries
0.70
Activations Density 0.013%