INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     prizes
    -1.29
     prize
    -1.19
     winners
    -1.09
     Prizes
    -1.07
     winning
    -1.05
     Prize
    -1.05
     awards
    -1.04
     premios
    -1.03
     winner
    -1.02
     premio
    -1.00
    POSITIVE LOGITS
     déf
    0.40
     sàng
    0.39
    estination
    0.39
     sû
    0.39
     phthal
    0.39
     defaultstate
    0.38
     aéri
    0.37
    FL
    0.36
    denn
    0.36
    ziej
    0.35
    Act Density 0.002%

    No Known Activations