INDEX
    Explanations

    phrases related to individual responsibility and awareness in decision-making

    New Auto-Interp
    Negative Logits
    ùng
    -0.15
    ReadOnly
    -0.15
    iegel
    -0.14
    ÑĤик
    -0.14
    edar
    -0.14
    ants
    -0.13
    .must
    -0.13
    IGHL
    -0.13
    iffe
    -0.13
     ReadOnly
    -0.13
    POSITIVE LOGITS
     nicht
    0.46
    ä¸įä¼ļ
    0.45
     not
    0.44
    ä¸įæĺ¯
    0.42
     niet
    0.42
     não
    0.41
     không
    0.40
     не
    0.40
     neither
    0.38
     doesn
    0.38
    Act Density 0.360%

    No Known Activations