INDEX
Explanations
mentions of personal achievements or accomplishments
expressions indicating significant achievements or milestones
New Auto-Interp
Negative Logits
specialize
-0.70
anwhile
-0.63
debian
-0.63
roam
-0.60
spor
-0.60
ebted
-0.58
recite
-0.58
pedd
-0.57
ournal
-0.56
Seeking
-0.56
POSITIVE LOGITS
unbelievable
0.91
unheard
0.87
icing
0.87
ceivable
0.87
surreal
0.86
wow
0.85
ãħĭãħĭ
0.84
Wow
0.84
wow
0.81
Absolutely
0.80
Activations Density 0.426%