INDEX
    Explanations

    mentions of body parts

    instances of the word "ar" in various contexts

    New Auto-Interp
    Negative Logits
     Lumpur
    -0.87
    zinski
    -0.81
     Showdown
    -0.81
    rome
    -0.76
     Dew
    -0.76
    worth
    -0.73
    cade
    -0.70
     Mull
    -0.70
     Dull
    -0.67
     Rasmussen
    -0.66
    POSITIVE LOGITS
     ar
    3.69
     Ar
    1.60
    Ar
    1.57
     Archer
    1.29
     arch
    1.28
     AR
    1.16
     Ark
    1.16
     ank
    1.08
     arrow
    1.06
     bows
    1.06
    Act Density 0.011%

    No Known Activations