INDEX
Explanations
references to breaks or pauses in activities or careers
New Auto-Interp
Negative Logits
hurry
-0.14
URRE
-0.13
erton
-0.13
overlooked
-0.13
weep
-0.13
dawn
-0.13
ULAR
-0.13
outf
-0.13
strip
-0.13
hast
-0.13
POSITIVE LOGITS
break
0.56
breaks
0.48
pause
0.45
hiatus
0.45
Break
0.41
-break
0.40
break
0.40
ä¼ij
0.40
Pause
0.39
pause
0.38
Activations Density 0.254%