INDEX
    Explanations

    Code/Data parameters

    New Auto-Interp
    Negative Logits
     reven
    -0.07
     पेश
    -0.07
     modifiers
    -0.07
    odne
    -0.07
     satt
    -0.07
     prosecutor
    -0.07
     ود
    -0.07
     rollover
    -0.07
     nack
    -0.07
     linking
    -0.07
    POSITIVE LOGITS
    0.11
    0.09
    Dot
    0.09
    0.09
    -feira
    0.08
    г
    0.08
    0.08
    °↵↵
    0.08
    Statement
    0.08
    ء
    0.08
    Act Density 0.439%

    No Known Activations