INDEX
    Explanations

    references to specific sports teams, particularly the Atlanta Falcons and the Atlanta Braves

    references to specific sports teams, particularly the Atlanta Falcons and Braves

    New Auto-Interp
    Negative Logits
    lying
    -0.73
     Kardashian
    -0.69
    ocard
    -0.67
     srfAttach
    -0.66
     Niet
    -0.65
    pert
    -0.65
     Cao
    -0.63
    zanne
    -0.63
     dele
    -0.62
    tics
    -0.61
    POSITIVE LOGITS
     Falcons
    1.22
     Braves
    0.91
    layer
    0.82
    daq
    0.82
     Buccaneers
    0.81
     Hawks
    0.78
    ipeg
    0.78
    ï¸
    0.77
    BSD
    0.77
    Vision
    0.69
    Act Density 0.011%

    No Known Activations