INDEX
    Explanations

    French language

    New Auto-Interp
    Negative Logits
     CENTER
    -0.08
     tilt
    -0.07
    Both
    -0.07
    ’é
    -0.07
    ีก
    -0.07
    سمبر
    -0.06
    CENTER
    -0.06
    NER
    -0.06
    isk
    -0.06
     Pulse
    -0.06
    POSITIVE LOGITS
     cao
    0.07
     della
    0.07
    Joseph
    0.07
     definite
    0.07
     تصم
    0.07
    	Duel
    0.06
     các
    0.06
     nejd
    0.06
    rof
    0.06
    fdf
    0.06
    Act Density 0.029%

    No Known Activations