INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     كومونز
    -0.77
     betweenstory
    -0.63
     kohta
    -0.57
     */,
    -0.56
     صوتيه
    -0.52
    ++];
    -0.51
     marquées
    -0.50
     Normdatei
    -0.50
     rédaction
    -0.48
    -0.48
    POSITIVE LOGITS
     fun
    2.78
    fun
    2.24
    Fun
    2.13
     Fun
    2.12
     FUN
    2.02
     divertido
    1.79
     enjoyment
    1.78
    FUN
    1.78
     enjoyable
    1.76
     diversión
    1.67
    Act Density 0.196%

    No Known Activations