INDEX
Explanations
phrases indicative of interpersonal relationships and dialogue
pronouns after punctuation
New Auto-Interp
Negative Logits
которое
-0.60
яке
-0.53
urtstag
-0.51
trone
-0.50
YOND
-0.49
hivyo
-0.49
PONENTS
-0.49
いくつか
-0.49
thừa
-0.48
것은
-0.47
POSITIVE LOGITS
who
1.02
whom
0.99
此人
0.95
他是
0.93
whom
0.91
who
0.90
he
0.89
she
0.88
shes
0.87
him
0.84
Activations Density 0.356%