INDEX
Explanations
purely procedural steps or instructions
cooking-related actions and instructions
New Auto-Interp
Negative Logits
answers
-0.85
categ
-0.73
fielded
-0.68
situational
-0.65
royalty
-0.65
existed
-0.65
unanswered
-0.65
credential
-0.64
indebted
-0.64
treasury
-0.64
POSITIVE LOGITS
Remove
1.16
Pour
1.13
Slowly
1.10
Remove
1.05
Then
1.02
Drain
1.01
Stir
1.00
Fold
0.97
Bake
0.97
Divide
0.95
Activations Density 0.097%