INDEX
Explanations
phrases related to guiding or leading someone to a destination or experience
references to directions or pathways leading to specific destinations or experiences
New Auto-Interp
Negative Logits
vine
-0.69
icipated
-0.65
stock
-0.61
cised
-0.58
creen
-0.57
ored
-0.55
slips
-0.54
ende
-0.54
uracy
-0.53
extent
-0.53
POSITIVE LOGITS
nowhere
0.92
onwards
0.91
aback
0.86
onward
0.86
toward
0.83
closer
0.80
towards
0.80
into
0.80
Ô
0.77
into
0.76
Activations Density 0.133%