INDEX
Explanations
words related to uncertainty or doubt
phrases expressing uncertainty or lack of knowledge
New Auto-Interp
Negative Logits
nevertheless
-0.86
nonetheless
-0.79
indeed
-0.73
ifully
-0.70
moil
-0.69
curiously
-0.68
undoubtedly
-0.67
alternatively
-0.67
vic
-0.66
strangely
-0.64
POSITIVE LOGITS
mattered
0.89
cared
0.82
bothered
0.73
hin
0.73
grasp
0.71
know
0.71
FTWARE
0.70
understood
0.69
cares
0.68
bother
0.68
Activations Density 0.065%