INDEX
    Explanations

    instances of the word "in" to identify various contexts or situations

    New Auto-Interp
    Negative Logits
    enia
    -0.14
    gili
    -0.14
    _Err
    -0.14
    ikip
    -0.13
     Ãľst
    -0.13
     å¹
    -0.13
    tring
    -0.13
    eless
    -0.13
    GRAM
    -0.13
    misc
    -0.13
    POSITIVE LOGITS
     ways
    0.45
     away
    0.32
    ways
    0.30
     Ways
    0.30
     way
    0.29
     novel
    0.26
     manners
    0.26
    away
    0.25
     Away
    0.24
     anyway
    0.24
    Act Density 0.087%

    No Known Activations