INDEX
Explanations
phrases related to personal development and self-improvement
notions related to success and failure in personal and professional development
New Auto-Interp
Negative Logits
resumed
-0.91
Recall
-0.78
reopened
-0.75
evacuation
-0.75
fugitive
-0.74
alarmed
-0.71
evacuated
-0.69
recall
-0.67
repatri
-0.66
recalled
-0.65
POSITIVE LOGITS
shitty
1.05
fucking
0.95
shit
0.94
mediocre
0.93
ocre
0.92
FUCK
0.89
EVERY
0.89
goddamn
0.88
shit
0.87
bullshit
0.86
Activations Density 1.137%