INDEX
    Explanations

    references to thresholds and criteria in a context related to evaluation or measurement

    New Auto-Interp
    Negative Logits
    her
    -0.16
    arena
    -0.15
    ças
    -0.14
     Bain
    -0.14
    ifo
    -0.14
    CES
    -0.14
    orman
    -0.14
     nobody
    -0.14
    yp
    -0.14
     âĪ
    -0.13
    POSITIVE LOGITS
    istrovstvÃŃ
    0.16
    Ñĩен
    0.15
    odore
    0.15
    .Arg
    0.15
    baugh
    0.14
    oden
    0.14
    swith
    0.14
    .appspot
    0.14
     Kushner
    0.14
    ÏĢοÏĦε
    0.14
    Act Density 0.009%

    No Known Activations