INDEX
Explanations
character names and familial relationships in narratives
New Auto-Interp
Negative Logits
cef
-0.19
llu
-0.18
efs
-0.17
nodoc
-0.16
sealed
-0.15
lut
-0.15
UsageId
-0.15
.Sdk
-0.15
anches
-0.15
.gg
-0.15
POSITIVE LOGITS
igar
0.16
287
0.15
rial
0.15
screw
0.14
y
0.14
ipped
0.14
pilot
0.14
Blanco
0.14
ennon
0.14
reen
0.14
Activations Density 0.004%