INDEX
Explanations
specific formatting or markers in the text structure
Followed by words starting with lowercase letters
CALIBRATION, vector, parents
New Auto-Interp
Negative Logits
↵↵
-1.09
R
-0.86
O
-0.84
M
-0.82
I
-0.82
T
-0.81
E
-0.81
L
-0.80
B
-0.80
H
-0.79
POSITIVE LOGITS
itſelf
1.70
myſelf
1.64
pleaſure
1.49
Monfieur
1.42
་་
1.41
ſeveral
1.38
doubtnut
1.37
greateſt
1.37
themſelves
1.34
ſmall
1.34
Activations Density 0.215%