INDEX
    Explanations

    instances where something is clearly described or evident

    explicit indications of certainty or clarity

    New Auto-Interp
    Negative Logits
    lé
    -0.72
    rey
    -0.71
    lav
    -0.70
    oleon
    -0.69
    aily
    -0.69
    ucky
    -0.68
    ourke
    -0.68
    RY
    -0.68
    imer
    -0.67
    uese
    -0.66
    POSITIVE LOGITS
     deline
    0.96
     distinguish
    0.91
     identifiable
    0.86
     differentiated
    0.84
     marked
    0.83
     spelled
    0.81
     differentiate
    0.81
     distinguishing
    0.79
     outwe
    0.79
     distinguishes
    0.78
    Act Density 0.017%

    No Known Activations