INDEX
Explanations
phrases that indicate time duration or periods
New Auto-Interp
Negative Logits
rary
-0.18
ERRU
-0.15
776
-0.14
aspect
-0.14
inesis
-0.13
uci
-0.13
åľ°çĤ¹
-0.13
possibilit
-0.13
unlikely
-0.13
.addAction
-0.13
POSITIVE LOGITS
initial
0.31
successful
0.30
initially
0.26
failed
0.25
unsuccessful
0.24
Successful
0.24
Initial
0.23
Initial
0.23
initial
0.23
successfully
0.23
Activations Density 0.153%