INDEX
Explanations
transitional phrases indicating time or sequence of events
New Auto-Interp
Negative Logits
Antar
-0.16
aq
-0.16
chal
-0.15
azole
-0.15
ubits
-0.14
uyo
-0.14
tons
-0.14
istas
-0.14
Grade
-0.13
bru
-0.13
POSITIVE LOGITS
hin
0.17
SENT
0.15
sadd
0.14
Buen
0.14
Eigen
0.14
estring
0.14
.databinding
0.14
hrad
0.14
Smooth
0.14
pacing
0.14
Activations Density 0.154%