INDEX
Explanations
the word "around" and its variants, indicating a focus on surrounding contexts or environments
New Auto-Interp
Negative Logits
<bos>
-0.44
незавершена
-0.41
Trus
-0.39
]})
-0.39
)})
-0.38
truk
-0.37
'))
-0.35
Dea
-0.35
timewa
-0.35
'})
-0.34
POSITIVE LOGITS
around
1.43
AROUND
1.37
Around
1.37
around
1.36
Around
1.31
AROUND
1.27
вокруг
1.03
autour
1.02
alrededor
0.95
omkring
0.95
Activations Density 0.077%