INDEX
    Explanations

    actions and verbs related to communication and interaction

    Following transitive verbs

    New Auto-Interp
    Negative Logits
     تضيفلها
    -0.76
     ſch
    -0.66
     ſal
    -0.63
     ſta
    -0.62
     perfons
    -0.62
    참고
    -0.61
    ngdoc
    -0.61
     ſu
    -0.61
     umano
    -0.61
     viſ
    -0.59
    POSITIVE LOGITS
     em
    1.11
    Em
    0.79
     her
    0.70
     Em
    0.67
    em
    0.64
     ya
    0.64
     it
    0.64
     THOSE
    0.64
     everything
    0.63
     ça
    0.63
    Act Density 0.282%

    No Known Activations