INDEX
    Explanations

    words and characters in diverse languages and scripts

    New Auto-Interp
    Negative Logits
     การ์ตูน
    -0.68
     erop
    -0.67
     Statt
    -0.66
     ente
    -0.65
     Theſe
    -0.60
    ώρα
    -0.58
     τὸ
    -0.57
     Wegen
    -0.57
     Việt
    -0.56
    Autoritní
    -0.55
    POSITIVE LOGITS
     प
    0.96
     ا
    0.96
     ब
    0.94
    UpInside
    0.93
     א
    0.90
     پ
    0.89
     ग
    0.89
     म
    0.89
     enää
    0.87
     löytyy
    0.85
    Act Density 0.040%

    No Known Activations