INDEX
Explanations
phrases indicating the extent of something, typically using the phrase "all the way to" or "all the way down to"
phrases that indicate a direction or a path
New Auto-Interp
Negative Logits
imble
-0.70
issan
-0.65
anton
-0.62
ĪĴ
-0.62
onics
-0.61
liv
-0.56
aum
-0.56
ervation
-0.56
»Ĵ
-0.56
uci
-0.55
POSITIVE LOGITS
down
0.88
through
0.83
forward
0.78
back
0.75
points
0.73
enthusi
0.72
up
0.72
ÙĴ
0.71
across
0.71
round
0.70
Activations Density 0.015%