INDEX
    Explanations

    repetitive second-person pronouns, indicating an emphasis on personal engagement or addressing the reader directly

    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.04
    2:0.04
    3:0.05
    4:0.03
    5:0.13
    6:0.03
    7:0.02
    8:0.25
    9:0.06
    10:0.11
    11:0.10
    Negative Logits
    rament
    -1.94
    apter
    -1.62
    riber
    -1.62
     snipp
    -1.56
     Rak
    -1.52
     Jav
    -1.52
     Corpus
    -1.48
     Karn
    -1.43
     Arabian
    -1.34
    thumbnails
    -1.33
    POSITIVE LOGITS
    glas
    1.84
    xit
    1.82
    EVA
    1.66
    akeru
    1.62
     pretend
    1.58
    hovah
    1.54
    essage
    1.53
    icycle
    1.52
    ptin
    1.52
    FFFF
    1.52
    Act Density 0.089%

    No Known Activations