INDEX
    Explanations

    actions related to transformation and communication in various contexts

    New Auto-Interp
    Negative Logits
    yx
    -0.15
    lod
    -0.15
    oppins
    -0.14
    nici
    -0.14
    елен
    -0.14
    ubl
    -0.14
    ĽĦ
    -0.14
    onica
    -0.14
    inez
    -0.14
    quals
    -0.14
    POSITIVE LOGITS
    ãģĿãĤĮãģ¯
    0.25
     ones
    0.25
     doing
    0.23
    å®ĥ们
    0.22
     it
    0.22
     them
    0.20
     ê·¸ê²ĥ
    0.19
     оно
    0.19
    ones
    0.18
     itu
    0.18
    Act Density 0.023%

    No Known Activations