INDEX
Explanations
references to the news network FOX
references to the FOX news network
New Auto-Interp
Negative Logits
Pg
-0.69
Ach
-0.68
Sov
-0.64
ctor
-0.64
Stack
-0.63
unres
-0.63
Acknowled
-0.62
subsistence
-0.62
immersion
-0.62
ococ
-0.61
POSITIVE LOGITS
FOX
3.79
FOX
2.83
Fox
1.83
Fox
1.80
CBS
1.57
NBC
1.50
ABC
1.37
CBS
1.34
NBC
1.33
fox
1.28
Activations Density 0.016%