INDEX
    Explanations

    events or situations involving bears

    New Auto-Interp
    Negative Logits
     Butterfly
    -0.17
    rido
    -0.17
    æī¶
    -0.16
     lizard
    -0.16
     Pony
    -0.15
     poultry
    -0.15
     Millet
    -0.15
    ogra
    -0.15
     frog
    -0.15
     serpent
    -0.14
    POSITIVE LOGITS
     bears
    0.52
     bear
    0.52
     Bears
    0.44
     Bear
    0.44
    bear
    0.39
    Bear
    0.39
    çĨĬ
    0.37
     cub
    0.33
     Urs
    0.30
     polar
    0.29
    Act Density 0.020%

    No Known Activations