INDEX
Explanations
references to characters in a narrative
New Auto-Interp
Negative Logits
AddTagHelper
-0.94
تضيفلها
-0.82
Tipp
-0.81
SPS
-0.75
kaldı
-0.73
setOpen
-0.73
publicados
-0.72
yore
-0.71
Thine
-0.70
Duca
-0.69
POSITIVE LOGITS
characters
1.88
character
1.80
characters
1.63
character
1.56
CHARACTER
1.56
Character
1.53
Characters
1.51
Character
1.42
Characters
1.38
CHARACTERS
1.29
Activations Density 0.060%