INDEX
Explanations
phrases related to explaining or justifying something
phrases related to justification and explanation
New Auto-Interp
Negative Logits
boot
-0.72
largeDownload
-0.66
Carbuncle
-0.63
usa
-0.62
sshd
-0.61
aird
-0.60
headed
-0.60
cffffcc
-0.60
paced
-0.58
avery
-0.58
POSITIVE LOGITS
oneself
0.92
aloud
0.91
anything
0.88
loudly
0.87
yourself
0.84
Yourself
0.82
publicly
0.81
something
0.79
anything
0.78
truths
0.77
Activations Density 0.398%