INDEX
Explanations
news headlines or article titles ending with "Read more" links
high activation values indicating certain topics or themes related to news and current events
New Auto-Interp
Negative Logits
mosqu
-0.71
hooked
-0.69
answ
-0.66
Ͻ
-0.66
honored
-0.61
homebrew
-0.60
XL
-0.59
mbuds
-0.59
Compact
-0.59
scill
-0.58
POSITIVE LOGITS
than
0.87
Comments
0.77
perse
0.75
rug
0.70
dar
0.70
prev
0.69
fal
0.68
origin
0.67
notations
0.67
Fra
0.67
Activations Density 0.059%