INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    chtig
    -0.48
     Leal
    -0.46
    たった
    -0.46
    roek
    -0.45
     Safer
    -0.45
     betweenstory
    -0.44
    carpa
    -0.44
    一枚
    -0.44
    geta
    -0.44
     disambiguazione
    -0.44
    POSITIVE LOGITS
     wind
    2.11
     Wind
    1.83
    Wind
    1.81
     WIND
    1.65
    wind
    1.65
    WIND
    1.42
     winds
    1.38
     viento
    1.34
     Winds
    1.18
    1.16
    Act Density 0.006%

    No Known Activations