INDEX
Explanations
quantitative terms related to amounts or quantities
phrases emphasizing the concept of quantity or magnitude
New Auto-Interp
Negative Logits
ilities
-0.74
imens
-0.68
Pic
-0.67
missions
-0.67
umbn
-0.66
ivals
-0.66
ours
-0.65
ubs
-0.64
otos
-0.64
atches
-0.63
POSITIVE LOGITS
anymore
0.78
bang
0.71
efeated
0.70
!!!!
0.69
!?
0.65
!!!!!
0.62
Archdemon
0.62
sword
0.61
Kappa
0.61
!?"
0.59
Activations Density 0.062%