INDEX
    Explanations

    reflective character and observation

    New Auto-Interp
    Negative Logits
    0.67
    0.52
    0.50
    0.48
    ی
    0.47
    0.45
    ча
    0.45
    {
    0.42
    0.42
     aquela
    0.41
    POSITIVE LOGITS
     konfl
    0.51
    kaar
    0.49
     thinker
    0.47
    obus
    0.47
     prover
    0.47
     Lauf
    0.47
    rmsg
    0.46
     صبر
    0.46
    vern
    0.46
     انصاف
    0.46
    Act Density 0.001%

    No Known Activations