INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Stevenson
    -0.08
    itiro
    -0.08
     ved
    -0.07
    	    
    -0.07
     Ern
    -0.07
     Ned
    -0.07
     Malmö
    -0.07
    一下
    -0.07
     Ded
    -0.07
    currency
    -0.07
    POSITIVE LOGITS
     ideeën
    0.08
     Freude
    0.08
     opdrachten
    0.07
     incr
    0.07
     qiym
    0.07
     કિં
    0.07
     certificates
    0.07
     haine
    0.07
     enthusiasm
    0.07
    Apr
    0.07
    Act Density 0.021%

    No Known Activations