INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Neue
    -0.07
    hiro
    -0.07
     Partition
    -0.06
    Pear
    -0.06
    postgresql
    -0.06
    bff
    -0.06
     Heater
    -0.06
    sie
    -0.06
     professionalism
    -0.06
     Uma
    -0.06
    POSITIVE LOGITS
    	strncpy
    0.08
     збір
    0.07
    _item
    0.07
     eso
    0.06
    0.06
     inconsistent
    0.06
    0.06
     >",
    0.06
    	direction
    0.06
    0.06
    Act Density 0.013%

    No Known Activations