INDEX
    Explanations

    mentions of the word "penguins"

    New Auto-Interp
    Negative Logits
    Interstitial
    -0.82
    nda
    -0.78
    ¿½
    -0.77
    ysis
    -0.71
    phas
    -0.68
    nces
    -0.67
    nea
    -0.66
    guyen
    -0.65
    puter
    -0.65
    ameda
    -0.64
    POSITIVE LOGITS
     Penguins
    1.17
    insula
    0.96
    DragonMagazine
    0.94
     pengu
    0.87
     Hots
    0.83
    éĹĺ
    0.80
     Pengu
    0.77
    ozo
    0.77
     Sharks
    0.76
    sburg
    0.74
    Act Density 0.011%

    No Known Activations