INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    imbatore
    0.46
     Midway
    0.45
     ছই
    0.43
     प्रतिभाग
    0.42
    🏜
    0.41
     அதிமுக
    0.40
     तमिलनाडु
    0.39
    सिंग
    0.39
    美元
    0.39
    🐅
    0.38
    POSITIVE LOGITS
     Belgian
    2.50
     Belgium
    2.45
    Belgium
    2.20
     belg
    2.09
     Belgi
    2.05
     Belgien
    2.05
     België
    1.96
     Belgique
    1.95
     Antwerp
    1.93
     Brussels
    1.92
    Act Density 0.010%

    No Known Activations