INDEX
    Explanations

    phrases that suggest starting or initiating something significant

    New Auto-Interp
    Negative Logits
    ocale
    -0.16
    reste
    -0.16
    SystemService
    -0.15
    inha
    -0.15
    jÃŃt
    -0.15
    ër
    -0.15
    oose
    -0.14
    å§ĵ
    -0.14
    porto
    -0.14
    ctors
    -0.14
    POSITIVE LOGITS
     bang
    0.29
     Bang
    0.22
    bang
    0.18
     basics
    0.18
     followed
    0.18
    Bang
    0.17
     humble
    0.17
     premise
    0.17
    est
    0.17
     foundation
    0.16
    Act Density 0.079%

    No Known Activations