INDEX
Explanations
references to personal pronouns and their frequency
New Auto-Interp
Negative Logits
roj
-0.16
Äįi
-0.16
uyo
-0.15
roje
-0.15
779
-0.14
rvé
-0.14
ALCHEMY
-0.14
ivid
-0.14
è«ĸ
-0.14
RuntimeObject
-0.14
POSITIVE LOGITS
WH
0.42
wh
0.32
wh
0.32
whe
0.28
hen
0.27
_WH
0.27
“When
0.26
"When
0.26
Wh
0.25
Whe
0.25
Activations Density 0.088%