INDEX
Explanations
references to self-identity and personal reflection
Reflexive pronouns
reflexive pronouns
New Auto-Interp
Negative Logits
auroit
-0.43
pacchetto
-0.43
грн
-0.40
océan
-0.40
aceptas
-0.40
tuyến
-0.39
corruption
-0.39
بوابة
-0.39
anúncio
-0.39
ladr
-0.38
POSITIVE LOGITS
herself
0.98
selves
0.93
himself
0.89
Himself
0.88
Myself
0.88
Myself
0.88
Yourself
0.88
Yourself
0.87
myself
0.85
selves
0.83
Activations Density 0.058%