INDEX
Explanations
words indicating a progression towards an outcome, result, or decision
phrases that indicate a sequence of events or outcomes
New Auto-Interp
Negative Logits
igger
-0.80
ocene
-0.78
Yard
-0.76
igers
-0.74
adr
-0.73
utenberg
-0.73
raught
-0.73
rieve
-0.72
unity
-0.72
ir
-0.69
POSITIVE LOGITS
thereafter
0.93
importantly
0.87
consequently
0.85
rely
0.77
succeeded
0.77
alike
0.75
replaced
0.72
proceeded
0.72
replaces
0.72
anticipate
0.69
Activations Density 0.161%