INDEX
    Explanations

    emotional expressions and personal connections

    New Auto-Interp
    Negative Logits
     שוליים
    -0.59
    principalColumn
    -0.58
    providedIn
    -0.58
     Dist
    -0.48
    findpost
    -0.46
    TagMode
    -0.44
    生平
    -0.42
     dist
    -0.41
    ագրություններ
    -0.40
     DIST
    -0.39
    POSITIVE LOGITS
     your
    0.66
     EconPapers
    0.59
     noDo
    0.57
     yourself
    0.54
    your
    0.51
     vaš
    0.51
     yourselves
    0.47
     ваших
    0.46
     yours
    0.46
     darte
    0.45
    Act Density 0.734%

    No Known Activations