INDEX
    Explanations

    expressions of support and mutual reliance among individuals in a community

    Reciprocal actions or relationships

    New Auto-Interp
    Negative Logits
    ьте
    -0.49
     M
    -0.48
     I
    -0.47
    <eos>
    -0.47
    -0.47
     T
    -0.46
    ios
    -0.43
     CONT
    -0.43
      
    -0.43
     (
    -0.42
    POSITIVE LOGITS
     mutual
    1.55
    mutual
    1.41
     mutually
    1.38
     nhau
    1.29
     eachother
    1.29
     saling
    1.29
    お互
    1.25
    Mutual
    1.23
     einander
    1.22
     друг
    1.18
    Act Density 0.355%

    No Known Activations