INDEX
Explanations
instances of failure and unsuccessful attempts in various contexts
New Auto-Interp
Negative Logits
789
-0.16
okt
-0.16
ag
-0.15
Kash
-0.15
ign
-0.14
748
-0.14
bern
-0.14
uem
-0.14
NR
-0.14
sey
-0.14
POSITIVE LOGITS
Attempt
0.19
Attempt
0.18
attempts
0.17
attempt
0.17
OfType
0.16
attempt
0.16
forControlEvents
0.15
попÑĭÑĤ
0.15
UME
0.15
arro
0.15
Activations Density 0.189%