INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     فکر
    -0.07
     fell
    -0.07
    ascular
    -0.07
    ancial
    -0.07
    ACITY
    -0.07
    lüğ
    -0.07
    .mybatis
    -0.07
     marrying
    -0.06
    -0.06
     answering
    -0.06
    POSITIVE LOGITS
     unused
    0.09
     Unused
    0.07
    .range
    0.07
    :e
    0.06
    _disc
    0.06
    Ž
    0.06
     než
    0.06
    ()._
    0.06
    086
    0.06
     jed
    0.06
    Act Density 0.003%

    No Known Activations