INDEX
    Explanations

    modal verbs expressing possibility, necessity, or permission

    modal verbs and expressions of capability or possibility

    New Auto-Interp
    Negative Logits
     Echoes
    -0.67
    hler
    -0.66
     Plane
    -0.64
    DIV
    -0.64
    onductor
    -0.63
     Nanto
    -0.62
     Wage
    -0.61
     Drain
    -0.61
     Orchestra
    -0.61
     Mash
    -0.59
    POSITIVE LOGITS
    able
    0.92
    runs
    0.83
    tan
    0.81
    anc
    0.79
    iful
    0.78
    full
    0.77
    ier
    0.77
    rams
    0.77
    fully
    0.75
    oub
    0.75
    Act Density 0.611%

    No Known Activations