INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    0.88
    0.80
     рӯ
    0.72
     Kamane
    0.68
    0.67
    alı
    0.67
    0.66
    <0x87>
    0.65
    0.65
    0.65
    POSITIVE LOGITS
     @
    5.41
    @
    5.28
     `@
    3.96
     (@
    3.78
    (@
    3.72
    ,@
    3.54
    .@
    3.53
     "@
    3.47
    \@
    3.44
     @_
    3.40
    Act Density 0.238%

    No Known Activations