INDEX
    Explanations

    phrases that emphasize the concept of reaching out or making contact with others

    New Auto-Interp
    Negative Logits
    ieur
    -0.17
    aji
    -0.17
    ะ
    -0.15
    uros
    -0.14
    avel
    -0.14
    quette
    -0.14
    adero
    -0.14
     Canter
    -0.14
    ãĥ¼ãĥĦ
    -0.14
    å¥ı
    -0.14
    POSITIVE LOGITS
     reach
    0.32
     reaching
    0.30
    reach
    0.29
    Reach
    0.27
     Reach
    0.27
     reached
    0.25
     reaches
    0.24
     outreach
    0.23
    reachable
    0.23
     out
    0.23
    Act Density 0.024%

    No Known Activations