INDEX
    Explanations

    verbs related to actions or tasks that indicate progress or performance

    New Auto-Interp
    Negative Logits
    èĥ½
    -0.21
    èĥ½å¤Ł
    -0.18
    æľį
    -0.16
    çĦ¶
    -0.15
     frequ
    -0.15
    orge
    -0.15
     kunnen
    -0.14
    blr
    -0.14
    AEA
    -0.14
    æķ¢
    -0.14
    POSITIVE LOGITS
     feas
    0.22
     easily
    0.22
     anytime
    0.22
    à¹Ħà¸Ķ
    0.19
    à¹Įà¹Ħà¸Ķ
    0.17
    ä¸Ģä¸ĭ
    0.17
    inx
    0.17
     Easily
    0.17
     safely
    0.16
    ัà¸Ļà¹Ħà¸Ķ
    0.16
    Act Density 0.868%

    No Known Activations