INDEX
Explanations
prepositions and related terms indicating direction or orientation
direction toward a target
New Auto-Interp
Negative Logits
哔
-0.40
ektiv
-0.38
kew
-0.38
ricanes
-0.37
SEGUIR
-0.37
frankly
-0.36
stumped
-0.36
WebVitals
-0.36
resell
-0.36
taj
-0.36
POSITIVE LOGITS
Toward
1.48
Towards
1.45
toward
1.44
towards
1.44
Toward
1.42
Towards
1.41
towards
1.39
toward
1.39
hacia
1.01
hacia
0.94
Activations Density 0.028%