INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    互相
    1.89
     mutually
    1.76
    相互
    1.76
     saling
    1.73
     mutual
    1.69
     서로
    1.69
    彼此
    1.65
    Mutual
    1.61
     birbir
    1.56
    mutual
    1.49
    POSITIVE LOGITS
     detachment
    0.81
    լ
    0.75
     depth
    0.74
    сей
    0.73
     extensa
    0.72
    дж
    0.72
     muchas
    0.71
     profund
    0.70
     considérable
    0.70
     integration
    0.69
    Act Density 0.009%

    No Known Activations