INDEX
Explanations
different languages
Past tense verbs
New Auto-Interp
Negative Logits
would
-0.85
would
-0.81
will
-0.75
Would
-0.71
оригіналу
-0.71
fhould
-0.71
WOULD
-0.66
ShouldBe
-0.66
can
-0.65
exitRule
-0.65
POSITIVE LOGITS
was
0.95
did
0.94
came
0.91
took
0.90
went
0.87
gave
0.84
began
0.81
ended
0.79
Did
0.77
did
0.76
Activations Density 0.499%