INDEX
Explanations
actions related to flipping or rolling over
New Auto-Interp
Negative Logits
новниш
-0.47
poveznice
-0.46
Đi
-0.45
Popis
-0.45
どうしても
-0.45
plads
-0.44
oublié
-0.44
maaf
-0.44
்கு
-0.43
Po
-0.43
POSITIVE LOGITS
flipped
1.41
flips
1.41
flipping
1.39
flip
1.36
flip
1.26
Flip
1.24
Flip
1.19
overturn
1.15
turning
1.14
Turning
1.13
Activations Density 0.217%