INDEX
Explanations
phrases indicating completion or closure
phrases indicating closure or conclusion
New Auto-Interp
Negative Logits
ums
-0.75
vere
-0.69
itudes
-0.67
ourge
-0.67
uld
-0.66
outh
-0.64
nuts
-0.63
masses
-0.63
Gree
-0.62
bour
-0.62
POSITIVE LOGITS
ãĥ¼ãĥĨ
0.86
âĶĢâĶĢ
0.85
srfAttach
0.79
actionDate
0.77
REF
0.74
Ñĭ
0.71
curfew
0.71
jad
0.71
halt
0.71
halfway
0.70
Activations Density 0.034%