INDEX
    Explanations

    references to current events or controversial topics mentioned in news articles

    New Auto-Interp
    Negative Logits
    çīĪ
    -0.73
    OTOS
    -0.68
     Achievements
    -0.67
     Anxiety
    -0.66
     Bakr
    -0.65
     Ces
    -0.64
     Odyssey
    -0.64
     Emirates
    -0.63
     Calculator
    -0.63
     Administ
    -0.63
    POSITIVE LOGITS
    skinned
    1.27
    colored
    1.22
    haired
    1.17
    washed
    1.12
    collar
    1.12
    legged
    1.09
    backed
    1.08
    eyed
    1.07
    bodied
    1.06
    oak
    1.06
    Act Density 0.095%

    No Known Activations