INDEX
Explanations
unexpected outcomes or events in a narrative
New Auto-Interp
Negative Logits
onics
-0.81
anti
-0.72
mob
-0.66
ore
-0.66
onomy
-0.66
aim
-0.65
orean
-0.65
ove
-0.64
oug
-0.64
ismo
-0.62
POSITIVE LOGITS
refill
0.78
reapp
0.74
beg
0.73
realize
0.73
realise
0.72
LESS
0.72
abruptly
0.71
remind
0.70
ersen
0.70
subsequently
0.69
Activations Density 9.797%