INDEX
Explanations
phrases indicating completion or finality
New Auto-Interp
Negative Logits
ggles
-0.74
hement
-0.71
iatrics
-0.70
iously
-0.69
etheus
-0.69
aires
-0.68
angles
-0.66
onomic
-0.66
kefeller
-0.66
atively
-0.65
POSITIVE LOGITS
nutshell
0.87
goodbye
0.79
understatement
0.78
.ãĢį
0.76
folks
0.75
consolation
0.74
description
0.72
!
0.71
SPONSORED
0.70
alright
0.66
Activations Density 0.166%