INDEX
Explanations
instances of the phrase "I am" and variations of self-reference
New Auto-Interp
Negative Logits
968
-0.18
/goto
-0.15
_mE
-0.15
ردÙĩ
-0.14
вел
-0.14
scram
-0.14
.jd
-0.14
961
-0.14
287
-0.14
_mB
-0.13
POSITIVE LOGITS
encounter
0.44
encountering
0.39
encounters
0.35
Encounter
0.35
éģĩ
0.35
encountered
0.34
facing
0.33
faced
0.32
face
0.30
experience
0.29
Activations Density 0.088%