INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     paradigma
    -0.69
     colch
    -0.68
     psicologia
    -0.66
     rimb
    -0.66
     cristian
    -0.65
     masaj
    -0.63
     feltro
    -0.62
     balon
    -0.61
     rosas
    -0.61
     antropo
    -0.61
    POSITIVE LOGITS
     reluct
    1.72
     disagre
    1.64
     unwarran
    1.58
     pamph
    1.53
     apprehen
    1.52
     inev
    1.52
     emphat
    1.51
     desir
    1.50
     depic
    1.49
     impra
    1.46
    Act Density 0.394%

    No Known Activations