INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    histoire
    -0.06
    104
    -0.06
     bào
    -0.06
     theatre
    -0.06
    Array
    -0.06
    	ArrayList
    -0.06
    esity
    -0.06
     Куб
    -0.06
     теат
    -0.06
    _red
    -0.06
    POSITIVE LOGITS
     pin
    0.10
    .Pin
    0.09
    Pin
    0.09
     pins
    0.08
     Pin
    0.08
    pin
    0.07
    μα
    0.07
     anchored
    0.07
    -ps
    0.07
     Penalty
    0.07
    Act Density 0.009%

    No Known Activations