INDEX
    Explanations

    the word "The" at the beginning of sentences

    New Auto-Interp
    Negative Logits
     wikipagina
    -1.21
    -0.94
     виправивши
    -0.89
    Revenir
    -0.88
    +#+#
    -0.87
    Tikang
    -0.86
    routeProvider
    -0.84
    mybatisplus
    -0.82
    expandindo
    -0.80
     pinulongan
    -0.79
    POSITIVE LOGITS
     The
    1.35
    The
    1.34
     A
    0.80
     An
    0.71
    This
    0.69
    An
    0.68
     In
    0.67
    In
    0.66
     This
    0.65
    These
    0.65
    Act Density 0.368%

    No Known Activations