INDEX
Explanations
the letter 'a'
instances of the article "a."
New Auto-Interp
Negative Logits
appointments
-0.75
Emerson
-0.71
Angus
-0.68
Osh
-0.68
Oscar
-0.66
Allied
-0.66
opposite
-0.63
Oval
-0.63
objectively
-0.62
every
-0.61
POSITIVE LOGITS
lex
0.97
ria
0.96
vec
0.95
uras
0.85
ird
0.83
hhh
0.81
aaaa
0.81
guest
0.81
cess
0.81
rived
0.80
Activations Density 0.050%