INDEX
Explanations
references to quantities and counts, particularly using terms related to one or more items
"une" or "un" followed by another word
quantifiers and indefinite articles
New Auto-Interp
Negative Logits
myſelf
-0.83
OFDb
-0.82
Jefus
-0.80
himſelf
-0.80
itſelf
-0.80
Majefty
-0.79
greateſt
-0.78
ſche
-0.77
raiſ
-0.76
becauſe
-0.75
POSITIVE LOGITS
certain
0.83
very
0.80
cuantos
0.74
considerable
0.73
large
0.72
kind
0.71
few
0.70
good
0.69
fairly
0.67
rather
0.67
Activations Density 0.016%