INDEX
Explanations
phrases related to uncertainty or inquiries about a situation
phrases questioning authenticity or certainty
New Auto-Interp
Negative Logits
âĿ
-0.70
letter
-0.69
Nurs
-0.66
irez
-0.63
wagon
-0.62
ulas
-0.60
illac
-0.60
checks
-0.59
bye
-0.59
sectional
-0.58
POSITIVE LOGITS
why
1.10
whether
0.98
WHY
0.87
how
0.86
justify
0.76
why
0.73
whether
0.73
yx
0.71
whereabouts
0.71
wing
0.69
Activations Density 0.049%