INDEX
    Explanations

    dialogue and translation prompts

    New Auto-Interp
    Negative Logits
     need
    1.26
     needs
    1.21
     cần
    1.21
     needed
    1.18
    need
    1.13
    needed
    1.12
    needs
    1.10
     необходимо
    1.09
    Need
    1.06
    需要在
    1.03
    POSITIVE LOGITS
     あなた
    0.80
     USING
    0.77
     mittens
    0.73
    0.73
     Gossip
    0.73
     😏
    0.72
     BECAUSE
    0.71
     dones
    0.71
     Using
    0.71
     partners
    0.70
    Act Density 0.116%

    No Known Activations