INDEX
    Explanations

    references to animals and their well-being

    New Auto-Interp
    Negative Logits
    unda
    -0.18
    isa
    -0.16
     Eins
    -0.15
    urf
    -0.15
    shot
    -0.15
    over
    -0.15
    رÙĪÛĮ
    -0.15
    Rob
    -0.15
    uts
    -0.15
     Rox
    -0.15
    POSITIVE LOGITS
    afone
    0.18
    òi
    0.17
    ScreenState
    0.16
    elijk
    0.16
    _hal
    0.15
    KeyCode
    0.15
    geist
    0.15
     Bilg
    0.15
    isÃŃ
    0.15
     hal
    0.14
    Act Density 0.034%

    No Known Activations