INDEX
Explanations
phrases indicating temporal or chronological relationships
New Auto-Interp
Negative Logits
MAP
-0.15
loy
-0.14
ourg
-0.14
iod
-0.14
Produ
-0.13
edition
-0.13
ий
-0.13
Loy
-0.13
ikan
-0.13
asta
-0.13
POSITIVE LOGITS
arrival
0.20
arrive
0.20
arrived
0.20
joining
0.18
пÑĢибÑĭ
0.16
Powers
0.16
arriving
0.16
arriv
0.16
avez
0.15
lleg
0.15
Activations Density 0.136%