INDEX
    Explanations

    personal pronouns, especially in conversations

    New Auto-Interp
    Negative Logits
     is
    -0.86
     has
    -0.76
     itself
    -0.60
     exists
    -0.57
     does
    -0.55
     was
    -0.55
     seems
    -0.53
     betreft
    -0.52
     provides
    -0.51
    ıdır
    -0.49
    POSITIVE LOGITS
    parsedMessage
    1.05
     said
    0.98
    sizeCache
    0.94
    said
    0.93
    ]--;
    0.91
    脚注の使い方
    0.88
    Said
    0.84
    istoitu
    0.82
     Said
    0.79
    brainly
    0.79
    Act Density 0.015%

    No Known Activations