INDEX
    Explanations

    pronouns indicating personal experience or identity

    New Auto-Interp
    Negative Logits
    <bos>
    -0.82
     kasarigan
    -0.74
    IndentedString
    -0.67
    NameInMap
    -0.62
     noix
    -0.59
    ainville
    -0.56
     мәкал
    -0.56
    WithIOException
    -0.55
     gepubliceerd
    -0.53
     AspNetCore
    -0.52
    POSITIVE LOGITS
    Encyclopædia
    0.71
    layoutControl
    0.69
    LEncoder
    0.69
     ever
    0.68
    }\]
    0.68
    "):
    
    0.67
     TextInputType
    0.67
     المعل
    0.65
    ercises
    0.65
     they
    0.65
    Act Density 0.378%

    No Known Activations