INDEX
    Explanations

    elements related to changes in roles or responsibilities

    New Auto-Interp
    Negative Logits
    éri
    -0.16
    ensa
    -0.15
     Rhodes
    -0.15
    umin
    -0.14
    uhan
    -0.14
    utz
    -0.14
     Walton
    -0.14
     stos
    -0.13
    ssa
    -0.13
    ÑĥнкÑĤ
    -0.13
    POSITIVE LOGITS
     instead
    0.29
    instead
    0.27
     substituted
    0.21
     вмеÑģÑĤ
    0.21
     Instead
    0.20
     replacing
    0.18
    Instead
    0.18
     replace
    0.18
     replaced
    0.18
     substitute
    0.18
    Act Density 0.108%

    No Known Activations