INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     انك
    0.53
     ganglions
    0.53
     ordres
    0.50
    ກັບ
    0.50
     parecido
    0.49
     тысяч
    0.48
     სახელმწიფო
    0.47
     কাজল
    0.47
     މ
    0.47
    දි
    0.47
    POSITIVE LOGITS
    記事
    0.49
    álov
    0.46
     circle
    0.45
     historian
    0.44
     century
    0.43
     museum
    0.43
     Museum
    0.42
     minuman
    0.41
     mixture
    0.41
    0.41
    Act Density 0.001%

    No Known Activations