INDEX
Explanations
the word "Me" in sentences
the token representing the speaker or first-person perspective
New Auto-Interp
Negative Logits
paralle
-0.69
totality
-0.67
pole
-0.65
edged
-0.61
rift
-0.60
>>\
-0.59
overnight
-0.59
ordinate
-0.57
medium
-0.56
glim
-0.56
POSITIVE LOGITS
asuring
1.41
asured
1.19
cca
1.18
zzo
1.16
asures
1.11
asure
1.11
lda
1.10
aning
1.06
yers
1.03
ghan
1.02
Activations Density 0.030%