INDEX
Explanations
proper nouns, specifically the name "AMY GOODMAN" with varying activations within the text
mentions of the person Amy Goodman
New Auto-Interp
Negative Logits
urger
-0.76
alli
-0.69
thro
-0.68
separ
-0.68
antic
-0.67
hered
-0.62
revenge
-0.62
rating
-0.62
nu
-0.61
bowl
-0.61
POSITIVE LOGITS
GOODMAN
2.05
ertodd
0.91
GROUND
0.84
WATCHED
0.83
COVER
0.83
IFIED
0.81
EDITION
0.79
WATCH
0.76
PRESIDENT
0.76
ONSORED
0.75
Activations Density 0.015%