INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     positive
    -1.20
    positive
    -1.13
     Positive
    -1.12
    Positive
    -1.04
     POSITIVE
    -1.02
     positively
    -0.96
     positif
    -0.91
    POSITIVE
    -0.87
     positives
    -0.86
     positivas
    -0.86
    POSITIVE LOGITS
    Geographie
    0.49
    lename
    0.46
    ebra
    0.44
     للمعارف
    0.42
     religion
    0.41
     revanche
    0.39
     يتيمه
    0.38
     πάντα
    0.38
    рії
    0.36
    strtotime
    0.36
    Act Density 0.003%

    No Known Activations