INDEX
Explanations
phrases indicating progress or development over time
New Auto-Interp
Negative Logits
ovah
-0.16
olid
-0.15
timeofday
-0.15
OVE
-0.15
.nano
-0.14
dims
-0.14
ÑĶм
-0.14
ç§
-0.14
hec
-0.14
isku
-0.14
POSITIVE LOGITS
miles
0.28
Miles
0.27
distance
0.26
mile
0.24
distances
0.23
mile
0.22
Distance
0.22
-mile
0.21
distance
0.21
long
0.21
Activations Density 0.019%