INDEX
Explanations
texts discussing the directionality of time, focusing on concepts related to moving backwards or forwards in time
New Auto-Interp
Negative Logits
ateurs
-0.82
raltar
-0.81
rament
-0.81
oleon
-0.77
abase
-0.74
anooga
-0.74
lez
-0.74
riz
-0.73
rol
-0.72
atum
-0.72
POSITIVE LOGITS
wards
0.92
stairs
0.87
ward
0.87
compatibility
0.84
compat
0.78
spiral
0.78
WARD
0.77
step
0.74
side
0.72
reflection
0.66
Activations Density 5.090%