INDEX
    Explanations

    instances of the word "in" and related phrase structures

    New Auto-Interp
    Negative Logits
    auc
    -0.17
    andles
    -0.16
    emain
    -0.15
    lescope
    -0.15
     Hearth
    -0.15
    arcer
    -0.15
    eeper
    -0.14
    iple
    -0.14
    inges
    -0.14
    uÃŃ
    -0.14
    POSITIVE LOGITS
     bed
    0.16
     sharedInstance
    0.15
     traffic
    0.15
     Corner
    0.14
     corner
    0.14
     chairs
    0.14
    _bw
    0.14
     Jacques
    0.14
     chair
    0.14
    aya
    0.14
    Act Density 0.174%

    No Known Activations