INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     alma
    -0.07
    _activation
    -0.07
     Prec
    -0.06
     zx
    -0.06
    -zero
    -0.06
    λι
    -0.06
    -0.06
     hình
    -0.06
     Shelter
    -0.06
    eca
    -0.06
    POSITIVE LOGITS
     flowing
    0.07
    owing
    0.07
     crowds
    0.07
     accession
    0.07
     لـ
    0.06
    .signup
    0.06
    0.06
    }
    
    ↵
    0.06
    :H
    0.06
    0.06
    Act Density 0.005%

    No Known Activations