INDEX
Explanations
food-related instructions or cooking steps
punctuations and conjunctions indicating sequential steps in a recipe
New Auto-Interp
Negative Logits
estranged
-0.86
indebted
-0.79
unexplained
-0.79
privileged
-0.76
emblem
-0.75
prostitutes
-0.74
criminally
-0.73
affili
-0.73
offended
-0.73
oppressed
-0.73
POSITIVE LOGITS
Ideally
1.09
Depending
1.09
Alternatively
0.97
Optional
0.95
Required
0.94
Instructions
0.92
Assembly
0.92
Slowly
0.92
Once
0.90
Usually
0.90
Activations Density 0.271%