INDEX
Explanations
highlights or key points in a text
references to activities, events, or actions that indicate engagement or participation
New Auto-Interp
Negative Logits
ãĥ¼ãĥ
-0.73
apest
-0.73
ãĥĻ
-0.70
ioxide
-0.69
utable
-0.68
ãĥĩãĤ£
-0.68
оÐ
-0.68
ãĥĥãĥī
-0.67
Ľ
-0.66
abling
-0.66
POSITIVE LOGITS
during
1.24
for
1.20
with
1.15
without
1.00
against
0.98
throughout
0.98
in
0.98
alongside
0.98
before
0.97
after
0.95
Activations Density 0.736%