INDEX
Explanations
phrases related to confidence, conviction, and strong beliefs
New Auto-Interp
Negative Logits
atche
-0.74
imens
-0.71
ivals
-0.68
ilities
-0.66
pmwiki
-0.65
umbn
-0.65
Pic
-0.64
onde
-0.63
TAIN
-0.62
missions
-0.62
POSITIVE LOGITS
!!!!
0.71
bang
0.70
!!!!!
0.67
anymore
0.67
efeated
0.65
regardless
0.65
mattered
0.64
!?
0.64
â̦â̦
0.63
onwards
0.61
Activations Density 0.054%