INDEX
    Explanations

    the word "simply" and variations of it to indicate simplicity or straightforwardness in explanation

    New Auto-Interp
    Negative Logits
    ISupport
    -1.00
     culturelles
    -0.84
     regardant
    -0.84
     colorés
    -0.84
     Gallardo
    -0.83
     debout
    -0.82
     passim
    -0.82
    timbangkan
    -0.82
     atteinte
    -0.81
     préfé
    -0.81
    POSITIVE LOGITS
     SIMPLE
    0.97
    simpleType
    0.96
     Simply
    0.92
    Simply
    0.92
     simply
    0.86
     Simple
    0.86
    PLY
    0.85
     simple
    0.84
    simply
    0.80
    er
    0.77
    Act Density 0.095%

    No Known Activations