INDEX
Explanations
names of prominent figures in news and politics
occurrences of names and political references related to news coverage
New Auto-Interp
Negative Logits
osphere
-0.98
anguage
-0.92
ologies
-0.86
istries
-0.82
vertisement
-0.81
ocrine
-0.80
istic
-0.80
aily
-0.80
ually
-0.80
vironment
-0.78
POSITIVE LOGITS
chio
0.84
ARD
0.69
rict
0.69
minster
0.69
Abbey
0.67
pants
0.66
Isles
0.64
velt
0.63
arded
0.63
isSpecialOrderable
0.62
Activations Density 0.071%