INDEX
Explanations
elements related to legal arguments and judicial reasoning
New Auto-Interp
Negative Logits
彼は
-0.91
esso
-0.90
himself
-0.78
his
-0.76
的他
-0.73
彼が
-0.73
he
-0.73
彼の
-0.71
his
-0.70
ihn
-0.68
POSITIVE LOGITS
she
2.48
herself
2.45
her
2.41
그녀
2.20
herself
2.00
její
1.99
彼女の
1.98
hennes
1.98
彼女は
1.96
她的
1.91
Activations Density 2.038%