INDEX
Explanations
phrases indicating missed opportunities or alternate scenarios
phrases related to hypothetical situations and outcomes
New Auto-Interp
Negative Logits
Deb
-0.63
Brach
-0.63
thus
-0.61
ftime
-0.59
verend
-0.59
Must
-0.58
most
-0.58
currently
-0.58
tainment
-0.57
ennett
-0.57
POSITIVE LOGITS
spared
0.84
avoided
0.82
prevented
0.78
born
0.76
wolves
0.75
invented
0.74
saved
0.74
sooner
0.74
worse
0.73
hes
0.73
Activations Density 0.095%