INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ñ
    -0.08
    ור
    -0.08
     znam
    -0.08
     turist
    -0.08
     aspek
    -0.07
     Erit
    -0.07
    Steph
    -0.07
     quel
    -0.07
     aspet
    -0.07
     נח
    -0.07
    POSITIVE LOGITS
     наст
    0.08
     brightness
    0.07
     ول
    0.07
     الاعت
    0.07
     falsely
    0.07
     guitar
    0.07
     asumir
    0.07
     мя
    0.07
     fluorescence
    0.07
    rast
    0.07
    Act Density 0.094%

    No Known Activations