INDEX
Explanations
closing braces and their corresponding context in code snippets
New Auto-Interp
Negative Logits
and
-0.78
-
-0.61
,
-0.59
but
-0.58
in
-0.58
in
-0.57
s
-0.57
rest
-0.56
and
-0.56
one
-0.56
POSITIVE LOGITS
.)}
1.23
")));
1.20
"]}
1.20
})}
1.19
}}}
1.18
}}$}
1.17
}}}}
1.16
}}}
1.16
]}
1.16
")}
1.13
Activations Density 0.482%