INDEX
    Explanations

    phrases indicating absence or non-existence

    "None" and its variations

    New Auto-Interp
    Negative Logits
     pleaſure
    -0.51
    江苏
    -0.51
    tshire
    -0.50
    bewah
    -0.50
    erstein
    -0.50
    ophilus
    -0.49
    🏼
    -0.48
    ่วย
    -0.47
    /=
    -0.47
    ysław
    -0.47
    POSITIVE LOGITS
     CreateTagHelper
    1.01
     none
    0.82
     Ours
    0.78
    None
    0.78
    none
    0.77
    +#+
    0.76
     None
    0.75
     NONE
    0.75
    NONE
    0.74
     Mine
    0.72
    Act Density 0.054%

    No Known Activations