INDEX
    Explanations

    references to various bird species

    New Auto-Interp
    Negative Logits
     wol
    -0.19
     shark
    -0.18
     sharks
    -0.18
     Sharks
    -0.17
     Dogs
    -0.16
     canine
    -0.16
    çĬ¬
    -0.16
     dogs
    -0.16
     Dog
    -0.16
     puppy
    -0.16
    POSITIVE LOGITS
     bird
    0.47
     birds
    0.44
     Birds
    0.40
    birds
    0.40
    Bird
    0.40
    bird
    0.40
     Bird
    0.39
     пÑĤи
    0.34
    鸣
    0.31
    é³¥
    0.30
    Act Density 0.106%

    No Known Activations