INDEX
Explanations
prepositions followed by a distance or measurement
phrases indicating proximity or closeness
New Auto-Interp
Negative Logits
ãĥ£
-0.75
cow
-0.66
goodbye
-0.62
ple
-0.62
ãĥ¼ãĤ¯
-0.61
cu
-0.61
ANS
-0.61
de
-0.60
Definition
-0.60
rod
-0.60
POSITIVE LOGITS
ieth
0.87
bounds
0.86
isine
0.83
imore
0.81
ĵĺ
0.80
isode
0.79
parentheses
0.76
emort
0.76
ciating
0.72
orbit
0.72
Activations Density 0.025%