INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    0.83
    <0x0D>
    0.63
    5
    0.62
    ra
    0.58
     mesmos
    0.56
    <strong>
    0.55
    ιλ
    0.55
    ی
    0.54
    ވާ
    0.54
    in
    0.53
    POSITIVE LOGITS
     time
    1.23
    Time
    1.15
     समय
    1.15
     Time
    1.14
     čas
    1.06
     टाइम
    1.02
     시간
    1.00
     waktu
    0.96
    時間
    0.96
    TIME
    0.94
    Act Density 0.081%

    No Known Activations