INDEX
Explanations
names of characters or people
characters and their actions within a narrative context
New Auto-Interp
Negative Logits
Repeat
-0.79
IED
-0.77
interstitial
-0.77
Depth
-0.72
PLIED
-0.72
NUM
-0.71
orted
-0.69
vered
-0.68
TPPStreamerBot
-0.67
IFIED
-0.67
POSITIVE LOGITS
discovers
1.70
learns
1.61
escapes
1.51
convin
1.49
decides
1.48
confronts
1.48
realizes
1.43
wakes
1.38
tries
1.35
finds
1.34
Activations Density 0.255%