INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ,K
    -0.07
    bk
    -0.06
     Lunch
    -0.06
     телеф
    -0.06
     $↵↵
    -0.06
    ,!
    -0.06
     UIT
    -0.06
    /li
    -0.06
    enz
    -0.06
    aju
    -0.06
    POSITIVE LOGITS
    -century
    0.23
     história
    0.07
    mast
    0.07
     Middle
    0.07
     century
    0.07
     species
    0.07
    apor
    0.07
     ảnh
    0.07
    δης
    0.06
    Quest
    0.06
    Act Density 0.004%

    No Known Activations