INDEX
    Explanations

    verbs indicating possibility or capability

    New Auto-Interp
    Negative Logits
    Ú©ÙĨ
    -0.16
    aben
    -0.16
    ieber
    -0.15
    al
    -0.15
    ake
    -0.15
    HM
    -0.14
    889
    -0.14
    fail
    -0.14
    hm
    -0.14
    nod
    -0.14
    POSITIVE LOGITS
     Raphael
    0.15
    ucas
    0.15
    uç
    0.15
    ANGLES
    0.15
    .nt
    0.15
     Ñĥж
    0.14
    ibble
    0.14
     worse
    0.14
    eds
    0.14
    enarios
    0.14
    Act Density 0.054%

    No Known Activations