INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     VIDEOS
    -0.78
    athi
    -0.67
    Bas
    -0.66
    MENTS
    -0.65
    BOX
    -0.64
    nor
    -0.63
    STE
    -0.61
     Waiting
    -0.60
     Editors
    -0.59
    MENT
    -0.58
    POSITIVE LOGITS
     translates
    1.10
     consists
    1.00
     originated
    0.98
     comprises
    0.96
     specializes
    0.95
     incidentally
    0.95
     represents
    0.93
     resulted
    0.92
     consisted
    0.92
     operates
    0.91
    Act Density 0.097%

    No Known Activations