INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     bateria
    -0.08
    国务院
    -0.08
     مني
    -0.08
    =Math
    -0.08
     oks
    -0.08
    lijnen
    -0.07
     منك
    -0.07
     संसद
    -0.07
     papir
    -0.07
    	dr
    -0.07
    POSITIVE LOGITS
     nick
    0.08
    0.08
     childish
    0.08
    yellow
    0.08
    .blue
    0.08
     yellow
    0.08
    (child
    0.08
     constructor
    0.08
    blue
    0.07
    Buzz
    0.07
    Act Density 0.007%

    No Known Activations