INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    pop
    -0.06
    lásil
    -0.06
    -0.06
     покры
    -0.06
     Not
    -0.06
    "Not
    -0.06
     ya
    -0.06
     not
    -0.06
     deposits
    -0.06
    ัปดาห
    -0.06
    POSITIVE LOGITS
    ine
    0.21
    INE
    0.17
    ines
    0.14
    INES
    0.12
    nine
    0.10
     Nine
    0.10
    ini
    0.10
    wine
    0.10
    ive
    0.10
    anine
    0.10
    Act Density 0.024%

    No Known Activations