INDEX
Explanations
phrases or words related to giving or receiving directions
references to guidance or navigation
New Auto-Interp
Negative Logits
bon
-0.76
tle
-0.73
IGH
-0.72
Mi
-0.68
ony
-0.68
ighth
-0.66
osphere
-0.65
nia
-0.63
ggies
-0.62
Assembly
-0.62
POSITIVE LOGITS
directions
1.54
Directions
1.03
direction
0.92
instructions
0.82
pread
0.78
thereto
0.75
autions
0.72
srf
0.70
commands
0.69
icular
0.68
Activations Density 0.009%