INDEX
Explanations
mentions of "Fox News" and its associated personalities
New Auto-Interp
Negative Logits
Rudd
-0.15
her
-0.15
olle
-0.15
onica
-0.14
oop
-0.14
tte
-0.14
ecial
-0.14
ont
-0.13
ifers
-0.13
اضÙĬ
-0.13
POSITIVE LOGITS
Compat
0.15
411
0.14
REW
0.14
ipsis
0.14
æĸ
0.14
paramName
0.14
comb
0.13
è
0.13
emetery
0.13
zilla
0.13
Activations Density 0.014%