INDEX
Explanations
numeric identifiers within strings
instances of specific keywords and identifiers
New Auto-Interp
Negative Logits
Sally
-0.83
Ler
-0.77
Sz
-0.76
Stew
-0.75
Ens
-0.72
Elise
-0.71
Sigma
-0.70
Sak
-0.70
Isle
-0.69
Ell
-0.69
POSITIVE LOGITS
b
1.47
bs
1.39
bis
1.29
ber
1.28
bb
1.27
bol
1.26
ba
1.25
bish
1.23
bral
1.20
bi
1.19
Activations Density 0.222%