INDEX
Explanations
descriptions of physical actions and interactions between characters
emotional responses and physical actions associated with characters in a narrative
New Auto-Interp
Negative Logits
ancial
-0.85
,'"
-0.81
guiName
-0.80
!'"
-0.79
osate
-0.79
TRUMP
-0.77
busters
-0.72
',"
-0.72
ornia
-0.71
'"
-0.71
POSITIVE LOGITS
Jaune
1.20
Pyrrha
1.09
Weasley
1.02
sighed
0.92
Naruto
0.92
nodded
0.91
teasing
0.91
Subaru
0.90
blance
0.89
Machina
0.88
Activations Density 0.643%