INDEX
Explanations
articles and determiners in sentences referencing packages or items
New Auto-Interp
Head Attr Weights
0:0.12
1:0.03
2:0.05
3:0.05
4:0.03
5:0.04
6:0.24
7:0.03
8:0.04
9:0.26
10:0.03
11:0.03
Negative Logits
Alec
-4.12
Kenobi
-4.10
Beck
-4.08
Lana
-4.04
Stro
-3.96
Derby
-3.94
Weed
-3.93
Reeves
-3.76
Twain
-3.76
Cena
-3.75
POSITIVE LOGITS
packages
8.41
Package
8.25
packages
7.92
Package
7.75
package
7.59
package
7.54
Pack
7.03
PACK
6.50
Pack
6.09
pkg
5.30
Activations Density 0.002%