INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     wearing
    -1.29
     Wearing
    -1.11
     holding
    -1.05
     emitting
    -1.01
    wearing
    -0.98
    Wearing
    -0.96
     carrying
    -0.93
     emitted
    -0.76
     wielding
    -0.75
     emit
    -0.71
    POSITIVE LOGITS
     CreateTagHelper
    0.83
     kaarangay
    0.66
    adpleegd
    0.65
    DockStyle
    0.65
     autorytatywna
    0.61
    antiate
    0.60
     CURIAM
    0.59
    )*/
    0.59
    [])
    
    0.59
    SequentialGroup
    0.59
    Act Density 0.027%

    No Known Activations