INDEX
    Explanations

    terms or phrases related to user interface elements and button interactions

    New Auto-Interp
    Negative Logits
     épaules
    -0.78
     vérit
    -0.76
     pouvoirs
    -0.74
     lèvres
    -0.74
     négociations
    -0.71
     vœux
    -0.71
     gouvernements
    -0.70
     boî
    -0.68
     leçons
    -0.67
     genoux
    -0.66
    POSITIVE LOGITS
     these
    0.72
    these
    0.65
    tols
    0.63
     sánh
    0.61
     those
    0.60
     theses
    0.60
    iths
    0.59
    nessed
    0.58
    leps
    0.57
    dities
    0.57
    Act Density 0.771%

    No Known Activations