INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     steady
    -0.07
     simple
    -0.06
    lap
    -0.06
    Head
    -0.06
     food
    -0.06
    154
    -0.06
     armor
    -0.06
     obedient
    -0.06
    arlo
    -0.06
    getChild
    -0.06
    POSITIVE LOGITS
     disgusting
    0.13
     disgust
    0.11
     disgusted
    0.10
    BUG
    0.08
    hex
    0.07
     IGN
    0.07
     rev
    0.07
    0.06
     Baron
    0.06
     Jenkins
    0.06
    Act Density 0.010%

    No Known Activations