INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ――
    -0.07
    oline
    -0.07
     };
    -0.07
     Overview
    -0.07
    guard
    -0.06
    	ERR
    -0.06
     }:
    -0.06
     libero
    -0.06
     поч
    -0.06
    shield
    -0.06
    POSITIVE LOGITS
     zda
    0.07
     nemoh
    0.06
    0.06
     Qty
    0.06
    0.06
     Shepherd
    0.06
     trà
    0.06
     Meer
    0.06
     Watts
    0.06
    فس
    0.06
    Act Density 0.012%

    No Known Activations