INDEX
    Explanations

    instances of the word "in" followed by another word

    instances of the phrase "sitting in."

    New Auto-Interp
    Negative Logits
    ŀ
    -0.68
     endors
    -0.64
    aler
    -0.64
    elf
    -0.62
    linger
    -0.59
    NOW
    -0.58
    llor
    -0.58
    lihood
    -0.58
    killed
    -0.54
     purch
    -0.54
    POSITIVE LOGITS
    ordinate
    1.15
     accordance
    1.02
    offensive
    1.01
    animate
    0.99
     situ
    0.99
     lieu
    0.98
     front
    0.97
     limbo
    0.95
    between
    0.95
    roads
    0.94
    Act Density 0.225%

    No Known Activations