INDEX
Explanations
phrases or expressions indicating movement or transitions
New Auto-Interp
Negative Logits
appa
-0.15
attern
-0.15
discharge
-0.15
Chu
-0.15
905
-0.14
-os
-0.14
ablish
-0.14
áy
-0.14
769
-0.14
formation
-0.14
POSITIVE LOGITS
icari
0.16
zac
0.16
sla
0.15
.configure
0.15
adel
0.15
alic
0.14
ansson
0.14
oker
0.14
iode
0.14
yen
0.13
Activations Density 0.268%