INDEX
Explanations
time references in a specific format - hours followed by 'a.m.' or 'p.m.'
occurrences of the letter 'm'
New Auto-Interp
Negative Logits
theless
-0.64
DragonMagazine
-0.58
acknowled
-0.54
blink
-0.53
behav
-0.53
apartheid
-0.52
conson
-0.51
literature
-0.51
disob
-0.50
spare
-0.50
POSITIVE LOGITS
.,
1.80
.;
1.62
.?
1.54
.:
1.49
.,"
1.45
./
1.34
.–
1.27
.—
1.25
.-
1.23
.),
1.21
Activations Density 0.020%