INDEX
Explanations
references to advertising and advertisers
New Auto-Interp
Negative Logits
ologies
-0.77
Patriarch
-0.72
VK
-0.72
APS
-0.72
arity
-0.71
stood
-0.69
confinement
-0.65
ologically
-0.64
surviving
-0.64
turtle
-0.61
POSITIVE LOGITS
vertis
1.09
advertisers
0.95
vertising
0.93
eering
0.91
elaide
0.87
advertis
0.85
ounce
0.84
elson
0.81
Advertising
0.75
ele
0.74
Activations Density 0.006%