INDEX
Explanations
references to the Fox News network and its programming
"Fox" followed by a news-related word
Fox News or Business
New Auto-Interp
Negative Logits
RenderAtEndOf
-0.88
Efq
-0.82
myſelf
-0.80
writeFieldEnd
-0.79
GEBURTSDATUM
-0.79
messageInfo
-0.79
Jefus
-0.78
msgTypes
-0.78
contextLoads
-0.78
kasarigan
-0.78
POSITIVE LOGITS
<eos>
0.37
IONE
0.37
pá
0.37
bar
0.37
ged
0.37
the
0.36
旭
0.35
Of
0.35
cual
0.35
onne
0.35
Activations Density 0.568%