INDEX
    Explanations

    references to connections between characters in a story

    the conjunction "and" in various contexts, indicating a focus on connection or addition

    New Auto-Interp
    Negative Logits
    ÑĮ
    -0.73
    rued
    -0.73
    anmar
    -0.70
    atars
    -0.70
    Ñı
    -0.69
    Were
    -0.68
    onica
    -0.65
    auga
    -0.65
    igate
    -0.65
    oward
    -0.65
    POSITIVE LOGITS
     prefers
    1.73
     enjoys
    1.69
     understands
    1.61
     knows
    1.60
     wants
    1.60
     believes
    1.58
     spends
    1.56
     loves
    1.55
     intends
    1.54
     thinks
    1.53
    Act Density 0.329%

    No Known Activations