INDEX
Explanations
repetitive second-person pronouns, indicating an emphasis on personal engagement or addressing the reader directly
New Auto-Interp
Head Attr Weights
0:0.07
1:0.04
2:0.04
3:0.05
4:0.03
5:0.13
6:0.03
7:0.02
8:0.25
9:0.06
10:0.11
11:0.10
Negative Logits
rament
-1.94
apter
-1.62
riber
-1.62
snipp
-1.56
Rak
-1.52
Jav
-1.52
Corpus
-1.48
Karn
-1.43
Arabian
-1.34
thumbnails
-1.33
POSITIVE LOGITS
glas
1.84
xit
1.82
EVA
1.66
akeru
1.62
pretend
1.58
hovah
1.54
essage
1.53
icycle
1.52
ptin
1.52
FFFF
1.52
Activations Density 0.089%