INDEX
Explanations
instructions or steps in a cooking recipe
punctuation marks and sentence endings in a recipe or instructional context
New Auto-Interp
Negative Logits
barred
-0.81
categ
-0.81
contradicted
-0.76
hijacked
-0.76
conflicting
-0.75
ridic
-0.73
wiser
-0.72
haunted
-0.72
deserted
-0.72
suicidal
-0.71
POSITIVE LOGITS
Repeat
1.37
Remove
1.35
Slowly
1.29
Pour
1.28
Then
1.27
Transfer
1.26
Place
1.26
Drain
1.25
Divide
1.24
Allow
1.22
Activations Density 0.075%