INDEX
    Explanations

    expressions indicating singular entities or concepts

    "One" followed by a temporal unit

    “one” followed by a noun

    New Auto-Interp
    Negative Logits
    rungsseite
    -0.72
    AndEndTag
    -0.61
    adaptiveStyles
    -0.59
    EndInit
    -0.58
     pleaſure
    -0.56
    endpush
    -0.56
    ArrowToggle
    -0.55
     ſu
    -0.55
     Meksiku
    -0.55
     ſche
    -0.54
    POSITIVE LOGITS
     كومونز
    0.71
     single
    0.63
    kuuta
    0.59
     liners
    0.58
    theless
    0.57
    }{*}{
    0.55
    single
    0.54
     sürü
    0.52
     liner
    0.51
     Single
    0.51
    Act Density 0.638%

    No Known Activations