INDEX
    Explanations

    phrases that initiate suggestions or directives

    New Auto-Interp
    Negative Logits
    ught
    -0.17
    ulses
    -0.16
    .AspNet
    -0.15
    ulton
    -0.15
    forest
    -0.14
    kses
    -0.14
    stown
    -0.14
    ogue
    -0.14
    ookies
    -0.14
     bergen
    -0.14
    POSITIVE LOGITS
    ROC
    0.15
    eds
    0.15
    .Xaml
    0.14
    ãĤĤãģĨ
    0.14
    _plural
    0.14
    ene
    0.14
    obl
    0.14
     Trident
    0.14
    enger
    0.13
     Hubb
    0.13
    Act Density 0.040%

    No Known Activations