INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Kul
    -0.07
     خواه
    -0.07
    Ol
    -0.07
     SVM
    -0.07
     Silver
    -0.06
     purple
    -0.06
     Україн
    -0.06
     Wein
    -0.06
     real
    -0.06
    ível
    -0.06
    POSITIVE LOGITS
     dispatch
    0.15
    Dispatcher
    0.13
     dispatcher
    0.13
    dispatch
    0.12
     Dispatch
    0.12
     dispatched
    0.12
    	dispatch
    0.12
    Dispatch
    0.11
     DISPATCH
    0.10
    _dispatch
    0.10
    Act Density 0.003%

    No Known Activations