INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     étr
    -0.41
     trucos
    -0.38
     peccato
    -0.37
     Стали
    -0.36
    weird
    -0.35
     sofern
    -0.35
    Pilih
    -0.35
     jueces
    -0.35
    ilmente
    -0.35
    Necesito
    -0.35
    POSITIVE LOGITS
    Campus
    1.41
     Campus
    1.38
     campus
    1.37
     CAMPUS
    1.34
    campus
    1.34
     campuses
    1.12
    CAMP
    0.77
     kampus
    0.74
    ppus
    0.71
    校园
    0.69
    Act Density 0.002%

    No Known Activations