INDEX
    Explanations

    articles and determiners in sentences referencing packages or items

    New Auto-Interp
    Head Attr Weights
    0:0.12
    1:0.03
    2:0.05
    3:0.05
    4:0.03
    5:0.04
    6:0.24
    7:0.03
    8:0.04
    9:0.26
    10:0.03
    11:0.03
    Negative Logits
     Alec
    -4.12
     Kenobi
    -4.10
    Beck
    -4.08
     Lana
    -4.04
     Stro
    -3.96
     Derby
    -3.94
     Weed
    -3.93
     Reeves
    -3.76
     Twain
    -3.76
     Cena
    -3.75
    POSITIVE LOGITS
    packages
    8.41
     Package
    8.25
     packages
    7.92
    Package
    7.75
     package
    7.59
    package
    7.54
    Pack
    7.03
     PACK
    6.50
     Pack
    6.09
    pkg
    5.30
    Act Density 0.002%

    No Known Activations