INDEX
    Explanations

    negative connotations of terms

    New Auto-Interp
    Negative Logits
     второй
    0.47
     deux
    0.46
     second
    0.46
     tercera
    0.45
     Galle
    0.45
     third
    0.43
     segundo
    0.41
     இரண்டு
    0.41
     thứ
    0.40
     two
    0.39
    POSITIVE LOGITS
    隐含
    0.38
    monary
    0.36
    তাসীন
    0.35
     సంబంధించిన
    0.35
    ্থিত
    0.35
     accurately
    0.34
    ledes
    0.34
    পিড
    0.34
    дикатор
    0.34
     লুকিয়ে
    0.34
    Act Density 0.000%

    No Known Activations