INDEX
    Explanations

    the preposition "in" across various contexts

    New Auto-Interp
    Negative Logits
    acz
    -0.15
    oux
    -0.15
    oved
    -0.15
     cass
    -0.15
    ollen
    -0.15
    uzey
    -0.14
    ocz
    -0.14
    NOWLED
    -0.14
    chet
    -0.14
    formed
    -0.14
    POSITIVE LOGITS
     SolidColorBrush
    0.15
    atar
    0.15
    istani
    0.15
    Circular
    0.15
     Circular
    0.14
    tit
    0.14
    mast
    0.14
    ndata
    0.14
    Stick
    0.14
    IMIT
    0.14
    Act Density 0.015%

    No Known Activations