INDEX
Explanations
proper nouns or titles typically featured in news articles
references to significant events or crises
New Auto-Interp
Negative Logits
oise
-0.81
tons
-0.64
opsis
-0.64
manufact
-0.63
osaurus
-0.63
Fine
-0.63
ilo
-0.62
Doodle
-0.62
Pony
-0.62
ean
-0.61
POSITIVE LOGITS
ccording
0.80
BBC
0.71
jer
0.67
accompan
0.66
Bus
0.66
Laura
0.66
dan
0.65
Posted
0.64
inav
0.64
bishops
0.62
Activations Density 0.129%