INDEX
    Explanations

    scientific texts

    New Auto-Interp
    Negative Logits
     Hav
    -0.07
     beautifully
    -0.07
    -0.07
     vowel
    -0.07
     Begins
    -0.07
    line
    -0.06
    uilt
    -0.06
     Ship
    -0.06
    ummings
    -0.06
    -0.06
    POSITIVE LOGITS
     Autor
    0.07
    abilir
    0.07
     frustr
    0.06
    sah
    0.06
     прож
    0.06
     objekt
    0.06
     gül
    0.06
    _ISO
    0.06
    uala
    0.06
    บรร
    0.06
    Act Density 0.070%

    No Known Activations