INDEX
    Explanations

    urgent calls to action or important information

    phrases that indicate must-watch content or urgent recommendations

    New Auto-Interp
    Negative Logits
     Entered
    -0.74
    bill
    -0.51
     arche
    -0.51
     Wilderness
    -0.50
     Burton
    -0.50
     Letter
    -0.49
     Haz
    -0.48
      
    -0.48
    wcs
    -0.48
     Construct
    -0.48
    POSITIVE LOGITS
     rgb
    0.73
     itself
    0.67
    ohm
    0.66
    ynasty
    0.64
    yssey
    0.61
    yip
    0.61
    ļé
    0.61
    rylic
    0.58
    eport
    0.58
    sson
    0.58
    Act Density 1.837%

    No Known Activations