INDEX
Explanations
references to characters and their emotions in a narrative
New Auto-Interp
Negative Logits
rai
-0.13
quez
-0.13
tti
-0.13
Eig
-0.13
warn
-0.13
lier
-0.13
áze
-0.13
gonna
-0.13
INET
-0.12
jon
-0.12
POSITIVE LOGITS
-même
0.19
ela
0.17
/her
0.15
Wander
0.15
/she
0.15
zelf
0.15
olvency
0.14
CLE
0.14
mob
0.14
alic
0.13
Activations Density 0.368%