INDEX
Explanations
references to ownership or possession
New Auto-Interp
Negative Logits
Monfieur
-0.77
Agamemnon
-0.76
raiſ
-0.74
נטרנט
-0.73
faſt
-0.71
menistan
-0.70
purpoſe
-0.69
Jegyzetek
-0.69
pleaſure
-0.66
Allez
-0.66
POSITIVE LOGITS
his
2.24
His
2.04
HIS
2.02
His
1.97
his
1.88
HIS
1.80
her
1.54
Her
1.41
他的
1.38
seiner
1.35
Activations Density 0.157%