INDEX
    Explanations

    mentions of "Fox News" and its associated personalities

    New Auto-Interp
    Negative Logits
     Rudd
    -0.15
    her
    -0.15
    olle
    -0.15
    onica
    -0.14
    oop
    -0.14
    tte
    -0.14
    ecial
    -0.14
    ont
    -0.13
    ifers
    -0.13
    اضÙĬ
    -0.13
    POSITIVE LOGITS
    Compat
    0.15
    411
    0.14
    REW
    0.14
    ipsis
    0.14
     æĸ
    0.14
     paramName
    0.14
     comb
    0.13
    è
    0.13
    emetery
    0.13
    zilla
    0.13
    Act Density 0.014%

    No Known Activations