INDEX
Explanations
references to academic institutions and educational contexts
New Auto-Interp
Negative Logits
his
-1.61
his
-1.49
istrinya
-1.21
His
-1.10
seinen
-1.07
HIS
-1.05
seine
-1.02
그의
-1.01
HIS
-1.00
seinem
-0.99
POSITIVE LOGITS
he
2.61
He
1.72
он
1.66
He
1.56
він
1.37
she
1.19
he
1.17
הוא
1.13
HE
1.07
hee
1.02
Activations Density 0.616%