INDEX
    Explanations

    pronoun and speech verb

    New Auto-Interp
    Negative Logits
    0.32
     दिखाता
    0.31
    0.31
     Questions
    0.30
    选项
    0.30
    ご覧
    0.30
    대로
    0.30
    を見て
    0.30
     समझते
    0.30
    Questions
    0.29
    POSITIVE LOGITS
     whispered
    0.61
     mus
    0.59
     murmured
    0.58
     said
    0.57
     declared
    0.56
     conceded
    0.56
     muttered
    0.56
     exclaimed
    0.55
     breathed
    0.55
     insisted
    0.55
    Act Density 0.014%

    No Known Activations