INDEX
Explanations
titles or headings in articles
New Auto-Interp
Negative Logits
chrom
-0.63
crou
-0.61
aples
-0.60
bases
-0.60
DERR
-0.60
EEK
-0.60
ometers
-0.60
bos
-0.59
base
-0.58
tripod
-0.58
POSITIVE LOGITS
"#
1.00
Dreams
0.83
Beware
0.81
"<
0.80
selves
0.79
titled
0.76
Dear
0.74
Fahrenheit
0.73
Nemesis
0.73
Endless
0.72
Activations Density 0.041%