INDEX
Explanations
times of day in a specific format (e.g., 9 a.m.)
occurrences of the letter 'a' in various contexts
New Auto-Interp
Negative Logits
ank
-0.58
ahime
-0.55
wrists
-0.54
unavoid
-0.53
Insurance
-0.53
newcom
-0.52
eyes
-0.52
forwards
-0.52
acknowled
-0.52
analogue
-0.51
POSITIVE LOGITS
.,
0.88
.;
0.79
.,"
0.79
.?
0.76
.),
0.75
clock
0.71
pm
0.68
._
0.67
versions
0.66
./
0.65
Activations Density 0.013%