INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    werk
    -0.07
    šk
    -0.07
     buffalo
    -0.07
     Boo
    -0.07
    udad
    -0.07
    $/,↵
    -0.06
     Debian
    -0.06
     allegedly
    -0.06
    Campaign
    -0.06
     incididunt
    -0.06
    POSITIVE LOGITS
     CommandLine
    0.06
    [".
    0.06
    (regex
    0.06
     homosexuals
    0.06
    058
    0.06
     Foam
    0.06
     That
    0.06
     kid
    0.06
    -MM
    0.06
    	 		
    0.06
    Act Density 0.003%

    No Known Activations