INDEX
Explanations
occurrences of the word "some" in various contexts
New Auto-Interp
Negative Logits
irie
-0.15
ped
-0.15
umer
-0.15
tempt
-0.14
uet
-0.14
hec
-0.14
ned
-0.14
ÑĤÑĢи
-0.14
çļĦä¸Ģ个
-0.13
ed
-0.13
POSITIVE LOGITS
/all
0.25
place
0.24
许
0.19
룬
0.18
kind
0.18
-times
0.18
ones
0.17
akin
0.17
æł·çļĦ
0.17
hw
0.17
Activations Density 0.097%