INDEX
    Explanations

    dollar signs indicating monetary values or references to currency

    New Auto-Interp
    Negative Logits
     مشين
    -0.94
     للمعارف
    -0.91
    олові
    -0.90
    Билгалдахарш
    -0.90
    rungsseite
    -0.89
    省市镇
    -0.87
    ंदीखरीदारी
    -0.86
    UnsafeEnabled
    -0.86
     Мексичка
    -0.84
    GEBURTSDATUM
    -0.84
    POSITIVE LOGITS
     way
    0.49
    وا
    0.44
    δι
    0.43
    ecap
    0.43
     modo
    0.42
     stream
    0.41
     lini
    0.40
     δι
    0.40
    0.40
    so
    0.39
    Act Density 0.012%

    No Known Activations