INDEX
    Explanations

    negations and words that indicate lack or absence

    New Auto-Interp
    Negative Logits
    znam
    -0.16
    lys
    -0.14
    439
    -0.14
    arkin
    -0.14
     spared
    -0.14
    оÑĤÑĥ
    -0.14
    plete
    -0.13
    rello
    -0.13
    intel
    -0.13
    htub
    -0.13
    POSITIVE LOGITS
     DISCLAIM
    0.19
    .scalablytyped
    0.18
    oha
    0.17
    AtA
    0.16
    .AutoComplete
    0.16
    keh
    0.15
    cki
    0.14
    asi
    0.14
    igin
    0.14
    pod
    0.14
    Act Density 0.096%

    No Known Activations