INDEX
Explanations
mentions of magazines
references to magazines and their related content
New Auto-Interp
Negative Logits
acted
-0.72
cker
-0.68
ident
-0.68
weak
-0.63
ensions
-0.62
speaking
-0.61
ple
-0.61
heed
-0.59
ctive
-0.58
iago
-0.58
POSITIVE LOGITS
azine
1.02
subscriptions
0.95
publisher
0.92
azines
0.88
Seym
0.87
publishers
0.84
magazines
0.81
covers
0.80
magazine
0.79
editors
0.79
Activations Density 0.029%