INDEX
Explanations
verbs indicating actions and experiences, particularly in a personal or historical context
New Auto-Interp
Negative Logits
ſelves
-0.68
يتيمه
-0.58
kpop
-0.56
"}"
-0.55
Mult
-0.54
houſe
-0.54
__))
-0.53
ृत
-0.53
("#{-0.53
τέλε
-0.53
POSITIVE LOGITS
himself
1.40
his
1.32
him
1.16
himself
1.07
His
1.05
He
1.05
His
0.98
Himself
0.97
he
0.91
的他
0.91
Activations Density 0.711%