INDEX
Explanations
news headlines or article titles that prompt the reader to "Read more."
instances of the word "Read," indicating sources or references for further information
New Auto-Interp
Negative Logits
xon
-0.82
opard
-0.70
IDS
-0.70
IDA
-0.67
ounty
-0.65
TEXTURE
-0.65
OPER
-0.64
adish
-0.64
uay
-0.64
ascal
-0.63
POSITIVE LOGITS
aloud
0.98
sburg
0.93
Write
0.86
iness
0.83
ahead
0.83
Read
0.83
just
0.80
gon
0.80
ying
0.78
scl
0.78
Activations Density 0.014%