INDEX
    Explanations

    describing characteristics or states

    New Auto-Interp
    Negative Logits
     campaigners
    0.54
     коже
    0.53
     ayudan
    0.52
     pomoći
    0.51
     alertas
    0.50
     anticor
    0.50
     molécules
    0.50
     atacar
    0.50
     elytris
    0.49
     QnrB
    0.49
    POSITIVE LOGITS
    0.53
    Bank
    0.48
    3
    0.48
     Square
    0.47
     Room
    0.45
     Airport
    0.44
     Bank
    0.43
     Mary
    0.43
    ond
    0.43
     Boom
    0.43
    Act Density 0.006%

    No Known Activations