INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Qualification
    -0.09
    typeparam
    -0.08
     Mathematics
    -0.08
     qualification
    -0.08
     Volleyball
    -0.08
    Qualification
    -0.08
     volleyball
    -0.08
     Academic
    -0.08
     konfer
    -0.07
    Academic
    -0.07
    POSITIVE LOGITS
     negative
    0.11
     negativo
    0.11
     negativa
    0.11
    negative
    0.10
    address
    0.10
     negatieve
    0.09
     negativ
    0.09
     address
    0.09
     negativity
    0.09
     Negative
    0.09
    Act Density 0.028%

    No Known Activations