INDEX
    Explanations

    words related to personal preferences and interactions

    New Auto-Interp
    Negative Logits
    ilion
    -0.81
    arta
    -0.81
    aunder
    -0.78
    ourse
    -0.69
    ocument
    -0.69
    DragonMagazine
    -0.67
    vernment
    -0.66
    ItemImage
    -0.66
    emin
    -0.65
    WF
    -0.64
    POSITIVE LOGITS
     seeing
    0.96
     watching
    0.95
     hearing
    0.84
     spicy
    0.81
     dearly
    0.80
    ably
    0.79
     surprises
    0.77
     listening
    0.76
    lihood
    0.75
     tink
    0.73
    Act Density 0.933%

    No Known Activations