INDEX
    Explanations

    phrases related to command or authority

    references to control or authority within a context

    New Auto-Interp
    Negative Logits
     TOUR
    -0.72
    INGTON
    -0.68
     PET
    -0.67
     execut
    -0.67
     voic
    -0.65
     Exile
    -0.65
     iP
    -0.64
     chars
    -0.62
     Atomic
    -0.62
     Bund
    -0.62
    POSITIVE LOGITS
    acea
    0.82
    ibaba
    0.81
    ndra
    0.81
    osate
    0.75
    prus
    0.74
    oglu
    0.73
    ossier
    0.73
    lda
    0.71
    aund
    0.69
    hai
    0.69
    Act Density 0.000%

    No Known Activations