INDEX
Explanations
references to characters and their relationships in narratives
New Auto-Interp
Negative Logits
seamnă
-0.72
Arund
-0.59
række
-0.54
thora
-0.54
setBorder
-0.53
vektör
-0.53
Anu
-0.51
internazionali
-0.51
Nant
-0.49
Tolu
-0.49
POSITIVE LOGITS
himself
1.88
his
1.75
himself
1.66
his
1.46
he
1.42
him
1.41
Himself
1.31
His
1.23
彼は
1.20
seinem
1.19
Activations Density 0.266%