INDEX
Explanations
comparative phrases indicating time or preference
New Auto-Interp
Negative Logits
oren
-0.79
xual
-0.78
ĸļ
-0.74
VIDEOS
-0.72
ebin
-0.69
ivid
-0.69
uay
-0.65
uran
-0.64
subdivision
-0.63
ossier
-0.62
POSITIVE LOGITS
Else
0.87
chance
0.79
worse
0.77
Faster
0.76
farther
0.75
trib
0.72
than
0.72
2024
0.70
Better
0.69
colder
0.68
Activations Density 0.028%