INDEX
    Explanations

    references to the Fox News network and its programming

    "Fox" followed by a news-related word

    New Auto-Interp
    Negative Logits
    RenderAtEndOf
    -0.88
     Efq
    -0.82
     myſelf
    -0.80
    writeFieldEnd
    -0.79
    GEBURTSDATUM
    -0.79
    messageInfo
    -0.79
     Jefus
    -0.78
    msgTypes
    -0.78
     contextLoads
    -0.78
     kasarigan
    -0.78
    POSITIVE LOGITS
    <eos>
    0.37
    IONE
    0.37
     pá
    0.37
     bar
    0.37
     ged
    0.37
     the
    0.36
    0.35
     Of
    0.35
     cual
    0.35
    onne
    0.35
    Act Density 0.568%

    No Known Activations