INDEX
Explanations
dates or chronological events
phrases that indicate temporal sequences or events
New Auto-Interp
Negative Logits
ranged
-0.88
continue
-0.72
atom
-0.67
objects
-0.67
incre
-0.66
wart
-0.66
imentary
-0.66
options
-0.64
iary
-0.64
cel
-0.64
POSITIVE LOGITS
dusk
0.91
ironic
0.88
bitters
0.86
dawn
0.83
htaking
0.80
raining
0.77
twilight
0.77
surreal
0.77
thrilling
0.77
TIME
0.76
Activations Density 0.187%