INDEX
Explanations
directional cues or instructions
phrases related to orientation and positioning
New Auto-Interp
Negative Logits
Flavoring
-0.74
ontent
-0.74
Paste
-0.72
ufact
-0.71
ineries
-0.68
Mich
-0.66
esters
-0.66
Taste
-0.65
Soup
-0.65
Ore
-0.65
POSITIVE LOGITS
downwards
1.60
sideways
1.55
perpendicular
1.52
downward
1.45
backwards
1.42
upward
1.36
backward
1.34
upwards
1.31
perpend
1.30
forwards
1.27
Activations Density 0.375%