INDEX
Explanations
phrases indicating a return or repetition
New Auto-Interp
Negative Logits
ubl
-0.15
restart
-0.14
Renew
-0.14
Rehab
-0.14
Wildcard
-0.14
-forward
-0.14
restart
-0.14
ucid
-0.14
rehabilit
-0.14
urette
-0.13
POSITIVE LOGITS
back
0.29
return
0.28
terug
0.28
returned
0.27
è¿ĶåĽŀ
0.26
returning
0.25
returns
0.24
return
0.24
Return
0.23
åĽŀ
0.23
Activations Density 0.099%