INDEX
Explanations
phrases expressing certainty or high probability
emphasizing adverbs that convey certainty or intensity
New Auto-Interp
Negative Logits
iling
-0.67
archment
-0.63
dding
-0.62
attempting
-0.62
oller
-0.62
guiActiveUnfocused
-0.61
Calling
-0.61
Warning
-0.61
senal
-0.60
Reporting
-0.60
POSITIVE LOGITS
happened
1.22
happens
1.13
hurts
1.07
transpired
1.06
feels
1.05
seems
1.04
occurred
1.01
boils
1.00
seemed
0.98
took
0.98
Activations Density 0.118%