INDEX
Explanations
phrases related to simplicity or plainness
instances of the word "plain"
New Auto-Interp
Negative Logits
otos
-0.88
yip
-0.78
umar
-0.73
glomer
-0.73
obal
-0.71
etheus
-0.70
onz
-0.70
conservancy
-0.70
lasses
-0.69
ept
-0.69
POSITIVE LOGITS
plain
1.10
text
1.07
sheet
0.91
plain
0.90
\\\\\\\\
0.90
cloth
0.84
rolled
0.83
ified
0.82
sheets
0.82
vanilla
0.80
Activations Density 0.014%