INDEX
    Explanations

    key comparisons and contrasting elements in narratives

    New Auto-Interp
    Negative Logits
    atti
    -0.16
    aby
    -0.16
     Hob
    -0.15
    709
    -0.14
    loc
    -0.14
    ica
    -0.13
    ellig
    -0.13
    -piece
    -0.13
     Bristol
    -0.13
     pieces
    -0.13
    POSITIVE LOGITS
     Libert
    0.16
    itore
    0.15
    vore
    0.15
    udes
    0.15
    íĮĮ
    0.14
    Opens
    0.14
    achat
    0.14
     backpage
    0.14
    ouser
    0.14
    å±±å¸Ĥ
    0.14
    Act Density 0.296%

    No Known Activations