INDEX
Explanations
occurrences of personal pronouns and their variations
New Auto-Interp
Negative Logits
onn
-0.17
ford
-0.15
kke
-0.15
Indented
-0.15
pong
-0.15
flag
-0.15
_Flag
-0.15
onen
-0.14
upil
-0.14
Shepherd
-0.14
POSITIVE LOGITS
ins
0.19
INS
0.17
ies
0.17
IES
0.17
inspector
0.16
IE
0.16
.accessToken
0.15
hub
0.15
prot
0.15
proto
0.15
Activations Density 0.025%