INDEX
    Explanations

    pronoun followed by verb

    New Auto-Interp
    Negative Logits
    reserved
    0.41
     Cannot
    0.41
    最適な
    0.39
     nothing
    0.38
     Preserve
    0.38
    izontally
    0.38
     সুরক্ষিত
    0.38
     Reserved
    0.38
     Silence
    0.38
     preserving
    0.37
    POSITIVE LOGITS
     بوده
    0.46
    だった
    0.45
     হবার
    0.44
     आहे
    0.43
     становится
    0.43
     ናቸው
    0.43
    であった
    0.43
     olduk
    0.42
     olma
    0.42
    0.42
    Act Density 0.086%

    No Known Activations