INDEX
Explanations
phrases related to ethics and morality
punctuation marks, specifically periods
New Auto-Interp
Negative Logits
paran
-0.63
pockets
-0.59
cradle
-0.58
accelerating
-0.57
notebooks
-0.57
collapsed
-0.56
misconceptions
-0.56
patriarch
-0.55
acle
-0.55
supplements
-0.54
POSITIVE LOGITS
actionDate
0.90
."
0.87
-.
0.80
\-
0.79
mil
0.75
certain
0.73
Ku
0.71
sit
0.69
until
0.69
################################
0.69
Activations Density 0.018%