INDEX
    Explanations

    instances of the word "in" used within various contexts throughout the text

    New Auto-Interp
    Negative Logits
    rouw
    -0.16
    agger
    -0.15
    aptor
    -0.14
    ĭ
    -0.14
    orest
    -0.14
    еÑī
    -0.14
    ither
    -0.14
    kop
    -0.14
    ickers
    -0.14
    ulin
    -0.14
    POSITIVE LOGITS
     ways
    0.20
    zik
    0.16
    ushima
    0.16
    ways
    0.15
    /light
    0.14
    Ħĸ
    0.14
     Ways
    0.14
    湯
    0.14
    anine
    0.14
     Scar
    0.14
    Act Density 0.255%

    No Known Activations