INDEX
    Explanations

    references to "side" and "sides," indicating a focus on aspects or perspectives of a situation

    New Auto-Interp
    Negative Logits
    itude
    -2.01
     live
    -1.60
     Spacewatch
    -1.51
    ifax
    -1.48
    «
    -1.48
     sleepy
    -1.46
    onde
    -1.46
    ulance
    -1.43
    itness
    -1.42
    itte
    -1.42
    POSITIVE LOGITS
    kick
    2.37
    walks
    2.06
    wall
    1.89
    ographies
    1.87
    plates
    1.81
    plays
    1.78
    velt
    1.78
    plate
    1.72
    piece
    1.70
    walls
    1.69
    Act Density 0.131%

    No Known Activations