INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    צ
    0.38
    ٠
    0.37
     rằng
    0.37
     utvik
    0.36
    زا
    0.35
     çık
    0.34
    ում
    0.34
    系統
    0.34
     udvik
    0.33
     bahwa
    0.33
    POSITIVE LOGITS
     names
    0.41
    handles
    0.40
     rufo
    0.40
    fruits
    0.39
    a
    0.39
    0.39
     названия
    0.38
     cloves
    0.38
     ASCII
    0.37
     რომელი
    0.37
    Act Density 0.041%

    No Known Activations