INDEX
    Explanations

    phrases indicating possession or existence

    New Auto-Interp
    Negative Logits
    essen
    -0.16
     Yourself
    -0.16
     PIC
    -0.14
    unds
    -0.14
    acious
    -0.14
    .lst
    -0.14
    PIC
    -0.14
    imar
    -0.14
     verw
    -0.14
    ayas
    -0.13
    POSITIVE LOGITS
    htag
    0.23
    bara
    0.22
    htable
    0.21
     become
    0.21
     implications
    0.20
    nt
    0.20
     always
    0.19
     potential
    0.19
     consequences
    0.19
     roots
    0.18
    Act Density 0.133%

    No Known Activations