INDEX
    Explanations

    locations or directions

    the preposition "in" in various contexts

    New Auto-Interp
    Negative Logits
    gey
    -0.71
    ynes
    -0.70
     therein
    -0.68
     Explain
    -0.67
    lain
    -0.65
     contributors
    -0.64
    UFC
    -0.62
     compares
    -0.62
     compared
    -0.61
    ect
    -0.60
    POSITIVE LOGITS
     search
    1.26
     handcuffs
    1.19
     dro
    1.12
     haste
    1.12
     pursuit
    1.07
     disgust
    1.03
     disguise
    0.99
     desperation
    0.92
     hopes
    0.91
     tears
    0.89
    Act Density 0.177%

    No Known Activations