INDEX
Explanations
the presence of the pronoun "I" to identify personal involvement or perspective
New Auto-Interp
Negative Logits
asca
-0.17
ihan
-0.15
enberg
-0.15
isz
-0.14
ffer
-0.14
eting
-0.14
eed
-0.13
олÑĥÑĩ
-0.13
othermal
-0.13
ope
-0.13
POSITIVE LOGITS
stated
0.22
mentioned
0.21
mentioned
0.21
.scalablytyped
0.20
age
0.19
progresses
0.18
ages
0.17
progress
0.17
said
0.17
Mention
0.17
Activations Density 0.036%