INDEX
    Explanations

    words related to stopping or halting actions, particularly in the context of traffic signs or commands

    "stop" or its variations

    New Auto-Interp
    Negative Logits
     EconPapers
    -0.83
    ItemBackground
    -0.80
    :✨
    -0.79
     HasFactory
    -0.76
    plier
    -0.75
     الحره
    -0.72
    DockStyle
    -0.71
     linkovi
    -0.70
    ]))
    
    -0.69
     שוליים
    -0.69
    POSITIVE LOGITS
     Stops
    0.96
     STOP
    0.94
     stops
    0.93
     Stop
    0.88
    STOP
    0.88
     stop
    0.85
    stops
    0.85
    Stop
    0.83
    stop
    0.80
    Stops
    0.79
    Act Density 0.074%

    No Known Activations