INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     européennes
    -0.84
     religieux
    -0.81
     médicaux
    -0.81
     Erie
    -0.79
     lupo
    -0.78
     Monfieur
    -0.77
     extérieures
    -0.77
     vectorielles
    -0.76
     sexuales
    -0.75
     Vicenza
    -0.74
    POSITIVE LOGITS
     still
    2.38
    still
    2.27
    STILL
    2.25
     Still
    2.23
    Still
    2.14
     STILL
    2.04
     masih
    1.38
    avía
    1.38
     Stil
    1.34
     stil
    1.29
    Act Density 0.056%

    No Known Activations