INDEX
    Explanations

    critical thresholds

    New Auto-Interp
    Negative Logits
    //*
    -0.08
    Bien
    -0.08
     Malaga
    -0.08
    fed
    -0.07
    Christian
    -0.07
     tranquilo
    -0.07
     intends
    -0.07
     Mediterr
    -0.07
     siglo
    -0.07
    이번
    -0.07
    POSITIVE LOGITS
     threshold
    0.19
     Threshold
    0.18
     thresholds
    0.18
    Threshold
    0.16
    _threshold
    0.16
    threshold
    0.15
     seuil
    0.14
    _THRESHOLD
    0.14
    .threshold
    0.14
     thresh
    0.13
    Act Density 0.030%

    No Known Activations