INDEX
    Explanations

    pronouns, particularly various forms of "we" and "you."

    New Auto-Interp
    Negative Logits
     autorytatywna
    -1.63
     виправивши
    -1.48
     AssemblyCulture
    -1.41
     disambiguazione
    -1.32
    GEBURTSDATUM
    -1.31
    Hentet
    -1.28
    tvguidetime
    -1.28
    Autoritní
    -1.27
     nakalista
    -1.14
    OGND
    -1.13
    POSITIVE LOGITS
    The
    1.16
    We
    1.01
    In
    0.97
    As
    0.96
    For
    0.93
     The
    0.90
    It
    0.89
    An
    0.86
    On
    0.79
    Do
    0.78
    Act Density 0.195%

    No Known Activations