INDEX
Explanations
recipes and cooking instructions
cooking instructions and processes
New Auto-Interp
Negative Logits
Samar
-0.71
ancestor
-0.71
outdated
-0.69
emblem
-0.69
shif
-0.69
affiliated
-0.69
implying
-0.69
denying
-0.66
terday
-0.66
treasury
-0.66
POSITIVE LOGITS
Slowly
1.04
Then
0.94
Then
0.93
Remove
0.92
Repeat
0.91
Once
0.89
Repeat
0.87
Instructions
0.87
Make
0.87
Recipe
0.87
Activations Density 0.149%