INDEX
Explanations
phrases related to actions people take in various processes or tasks
New Auto-Interp
Negative Logits
imb
-0.15
osc
-0.15
igit
-0.15
inet
-0.15
baum
-0.14
amba
-0.14
deg
-0.14
ãĥķãĤ
-0.14
osc
-0.14
ime
-0.14
POSITIVE LOGITS
à¹ĥà¸Ķ
0.21
ä»»ä½ķ
0.21
ANY
0.20
any
0.20
qualquer
0.19
ANY
0.18
cualquier
0.16
OTHERWISE
0.16
Ùħباش
0.15
jinak
0.15
Activations Density 0.053%