INDEX
    Explanations

    phrases related to preparation and encouragement to engage in activities

    New Auto-Interp
    Negative Logits
    oling
    -0.16
    plash
    -0.15
     Progress
    -0.14
    loth
    -0.14
    oggle
    -0.14
     Leader
    -0.14
     Literature
    -0.14
    obao
    -0.14
    ellas
    -0.14
    intColor
    -0.14
    POSITIVE LOGITS
    766
    0.15
     fran
    0.14
    931
    0.14
    ikk
    0.14
    çŃĭ
    0.14
    ewan
    0.14
    .compose
    0.14
    kk
    0.14
    722
    0.14
     offense
    0.13
    Act Density 0.222%

    No Known Activations