INDEX
Explanations
the letter "G" at various positions within the text
the presence of specific symbols or characters, particularly the character sequence "<|endoftext|>"
New Auto-Interp
Negative Logits
ĸļ
-0.70
Mellon
-0.64
reprodu
-0.61
occupied
-0.60
Crowd
-0.59
Wicked
-0.58
Bie
-0.57
Hyde
-0.57
pale
-0.57
Shelby
-0.57
POSITIVE LOGITS
roups
1.43
raphic
1.35
reetings
1.26
ossip
1.22
ourmet
1.21
reens
1.20
entle
1.19
rowth
1.18
athering
1.17
uild
1.17
Activations Density 0.046%