INDEX
    Explanations

    occurrences of the preposition "in"

    New Auto-Interp
    Negative Logits
    .gateway
    -0.16
    arth
    -0.16
    onom
    -0.15
    adele
    -0.14
    nox
    -0.14
    adel
    -0.14
    iller
    -0.14
     Hlav
    -0.14
    artin
    -0.13
    adal
    -0.13
    POSITIVE LOGITS
    illance
    0.15
    )./
    0.15
    ero
    0.15
     Beard
    0.15
    оди
    0.15
     eskort
    0.15
    .ศ
    0.14
    igel
    0.14
    çν
    0.14
    ÑĢел
    0.13
    Act Density 0.130%

    No Known Activations