INDEX
Explanations
the word "good" and related words indicating positivity or high quality
phrases or contexts indicating positive quality or reassurance
New Auto-Interp
Negative Logits
hyde
-0.79
eters
-0.75
oths
-0.73
atum
-0.73
pper
-0.70
eds
-0.69
ategory
-0.67
Tsukuyomi
-0.67
opers
-0.66
Reincarnated
-0.65
POSITIVE LOGITS
enough
1.35
luck
1.12
bye
1.05
reads
1.04
ol
1.03
luck
1.03
enough
1.03
Samar
1.02
intentions
1.00
quality
0.89
Activations Density 0.081%