INDEX
Explanations
words related to temporal events or sequences
New Auto-Interp
Negative Logits
then
-1.06
alors
-0.96
entonces
-0.93
now
-0.85
allora
-0.81
damaligen
-0.81
alors
-0.79
then
-0.78
Then
-0.76
então
-0.75
POSITIVE LOGITS
proceeded
0.99
proceed
0.94
comes
0.84
proceeds
0.82
again
0.80
followed
0.79
proceed
0.79
proceeding
0.75
cometh
0.72
followed
0.72
Activations Density 0.265%