INDEX
Explanations
phrases or statements expressing uncertainty
phrases that express uncertainty or questioning of certainty
New Auto-Interp
Negative Logits
Bloomberg
-0.60
yip
-0.56
portation
-0.53
Dangerous
-0.53
Diary
-0.51
uler
-0.51
ticket
-0.50
Land
-0.50
KK
-0.50
issance
-0.50
POSITIVE LOGITS
sure
1.21
yourselves
1.03
yourself
0.95
instance
0.95
certain
0.93
oneself
0.92
ourselves
0.88
Yourself
0.86
see
0.86
example
0.85
Activations Density 0.091%