INDEX
    Explanations

    specific patterns of characters or symbols

    New Auto-Interp
    Negative Logits
    argo
    -0.17
    asn
    -0.16
    urrent
    -0.14
    _RELEASE
    -0.14
    iв
    -0.14
     Current
    -0.13
    çīĮ
    -0.13
    mark
    -0.13
    Compilation
    -0.13
    noch
    -0.13
    POSITIVE LOGITS
     themselves
    0.40
     itself
    0.36
     himself
    0.36
     herself
    0.35
     Himself
    0.32
     yourself
    0.31
     oneself
    0.31
     ourselves
    0.30
     myself
    0.29
     yourselves
    0.29
    Act Density 0.009%

    No Known Activations