INDEX
    Explanations

    instances of contrasting viewpoints or actions leading to inconsistencies

    New Auto-Interp
    Negative Logits
    umper
    -0.16
    .vaadin
    -0.14
    nown
    -0.14
    жно
    -0.14
     Ramp
    -0.14
    bef
    -0.14
    ropp
    -0.14
    acket
    -0.14
    pur
    -0.14
    asmus
    -0.14
    POSITIVE LOGITS
    oe
    0.17
     MERCHANTABILITY
    0.15
    571
    0.14
    och
    0.14
    ona
    0.14
    Readable
    0.14
     resign
    0.14
    atoi
    0.14
     chatt
    0.13
    undi
    0.13
    Act Density 0.333%

    No Known Activations