INDEX
Explanations
mentions and discussions about characters in narratives
New Auto-Interp
Negative Logits
AddTagHelper
-1.04
Tipp
-0.76
Conducted
-0.72
Uribe
-0.69
SPS
-0.66
تضيفلها
-0.65
publicados
-0.65
kaldı
-0.65
Latch
-0.62
揄
-0.62
POSITIVE LOGITS
characters
1.59
character
1.50
characters
1.39
character
1.32
Characters
1.29
Character
1.27
CHARACTER
1.26
Characters
1.17
Character
1.14
CHARACTERS
1.06
Activations Density 0.038%