INDEX
    Explanations

    the word "around" and its variants, indicating a focus on surrounding contexts or environments

    New Auto-Interp
    Negative Logits
    <bos>
    -0.44
     незавершена
    -0.41
    Trus
    -0.39
    ]})
    -0.39
    )})
    -0.38
     truk
    -0.37
    '))
    -0.35
    Dea
    -0.35
    timewa
    -0.35
    '})
    -0.34
    POSITIVE LOGITS
    around
    1.43
     AROUND
    1.37
    Around
    1.37
     around
    1.36
     Around
    1.31
    AROUND
    1.27
     вокруг
    1.03
     autour
    1.02
     alrededor
    0.95
     omkring
    0.95
    Act Density 0.077%

    No Known Activations