INDEX
    Explanations

    verbs or phrases related to things being split or divided

    instances of the word "separated" and its variations

    New Auto-Interp
    Negative Logits
    vous
    -0.71
    ×Ķ
    -0.70
    WN
    -0.67
    OD
    -0.67
    cycl
    -0.65
    TL
    -0.65
    nz
    -0.65
     Briggs
    -0.63
     Gos
    -0.63
    enos
    -0.63
    POSITIVE LOGITS
     separating
    0.91
    separ
    0.84
     separ
    0.83
     separated
    0.83
     sexes
    0.81
     separates
    0.76
     detach
    0.74
    ĨĴ
    0.73
    ively
    0.73
    icut
    0.73
    Act Density 0.017%

    No Known Activations