INDEX
    Explanations

    multilingual text, code, and technical descriptions

    New Auto-Interp
    Negative Logits
     rebels
    0.83
     singing
    0.75
     rebellion
    0.75
     response
    0.73
     gesture
    0.73
     games
    0.72
    वाने
    0.72
     surprising
    0.72
     laughing
    0.72
     game
    0.71
    POSITIVE LOGITS
    0.90
    英文
    0.89
    <unused52>
    0.87
    িত্ব
    0.85
     Interpretation
    0.85
    ործ
    0.84
     beschäftigen
    0.83
    考察
    0.83
    0.83
    ्युअर
    0.83
    Act Density 0.167%

    No Known Activations