INDEX
    Explanations

    terms and phrases related to categorical definitions and classifications

    New Auto-Interp
    Head Attr Weights
    0:0.03
    1:0.02
    2:0.04
    3:0.05
    4:0.09
    5:0.03
    6:0.06
    7:0.40
    8:0.03
    9:0.03
    10:0.10
    11:0.07
    Negative Logits
     webcam
    -1.52
     lining
    -1.35
     swayed
    -1.32
     breeze
    -1.29
     Provided
    -1.28
     Printed
    -1.28
     withstand
    -1.23
     Joh
    -1.20
     hopping
    -1.20
    ..........
    -1.20
    POSITIVE LOGITS
    raph
    1.56
    grave
    1.50
     coined
    1.50
    Accessory
    1.50
     gener
    1.45
    tera
    1.44
    amphetamine
    1.41
    non
    1.41
    hani
    1.38
    opa
    1.38
    Act Density 0.013%

    No Known Activations