INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    0.45
     आयकर
    0.41
    0.40
    0.40
    0.39
    збе
    0.39
    0.39
    注明
    0.38
    ພາບ
    0.37
    Œ
    0.37
    POSITIVE LOGITS
     kw
    0.41
    ma
    0.36
    ula
    0.35
     woman
    0.35
    kw
    0.35
     Kw
    0.35
    <0xEA>
    0.34
    гъ
    0.34
     people
    0.34
    nu
    0.33
    Act Density 0.008%

    No Known Activations