INDEX
    Explanations

    references to wolves and their ecological significance

    New Auto-Interp
    Negative Logits
     turtles
    -0.17
    egg
    -0.16
     frogs
    -0.16
     Eggs
    -0.16
    -shell
    -0.16
    urtle
    -0.16
    ç¾½
    -0.16
     Hatch
    -0.15
    ajar
    -0.15
     eggs
    -0.15
    POSITIVE LOGITS
     wolf
    0.57
     Wolf
    0.53
     wolves
    0.51
    wolf
    0.49
     wol
    0.48
    Wolf
    0.46
     Wolves
    0.44
    çĭ¼
    0.43
     pack
    0.42
     lup
    0.40
    Act Density 0.048%

    No Known Activations