INDEX
    Explanations

    occurrences of the word "in" and its variations within different contexts

    New Auto-Interp
    Negative Logits
    igen
    -0.16
    ivery
    -0.15
     Linden
    -0.14
     ÎŃκ
    -0.14
    ocl
    -0.14
    386
    -0.13
    olls
    -0.13
     Reconstruction
    -0.13
    CRET
    -0.13
    Icon
    -0.13
    POSITIVE LOGITS
     downtown
    0.16
    styl
    0.16
    isque
    0.15
    вед
    0.15
     presence
    0.15
    phem
    0.15
     Room
    0.14
     town
    0.14
    ulla
    0.14
    isté
    0.14
    Act Density 0.154%

    No Known Activations