INDEX
Explanations
instances of the word "turn" and its variations, indicating a focus on actions involving taking turns or alternating
New Auto-Interp
Negative Logits
jde
-0.16
iram
-0.15
coles
-0.14
å
-0.14
wire
-0.14
Dag
-0.14
koli
-0.14
ulur
-0.14
jer
-0.14
dane
-0.13
POSITIVE LOGITS
zu
0.15
ört
0.14
еÑģÑĤв
0.14
ruz
0.14
rob
0.14
QL
0.14
št
0.14
ces
0.14
ussen
0.14
ovol
0.14
Activations Density 0.117%