INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Documented
    -0.49
    хьтан
    -0.46
    ņas
    -0.46
    pecabe
    -0.45
    Życiorys
    -0.45
    penuhi
    -0.45
     cueva
    -0.45
    addPreferredGap
    -0.45
     Dunlap
    -0.44
     bruja
    -0.44
    POSITIVE LOGITS
     motion
    1.00
     Motion
    0.88
    motion
    0.81
     MOTION
    0.80
    Motion
    0.79
     distinction
    0.77
     ratio
    0.76
     Ratio
    0.69
    ratio
    0.64
     Distinction
    0.62
    Act Density 0.097%

    No Known Activations