INDEX
Explanations
words related to technology and programming
verbs and proper nouns that indicate existence or identity
New Auto-Interp
Negative Logits
Sally
-0.72
touch
-0.71
129
-0.70
essa
-0.68
134
-0.68
onion
-0.67
secut
-0.64
128
-0.63
Reese
-0.62
ont
-0.62
POSITIVE LOGITS
b
1.30
B
1.20
Bs
1.19
bs
1.18
bg
1.15
bh
1.12
bd
1.12
BR
1.12
BN
1.11
BA
1.11
Activations Density 0.332%