INDEX
Explanations
text segments discussing actions related to mental and physical health
New Auto-Interp
Negative Logits
ſelves
-1.19
iſt
-1.17
AddTagHelper
-1.12
ſind
-1.05
المعيارى
-1.05
ſelf
-1.04
^(@)
-1.03
Meksiku
-1.02
itſelf
-1.02
Jefus
-1.02
POSITIVE LOGITS
to
0.67
for
0.59
of
0.58
in
0.57
,
0.56
.
0.54
↵↵
0.53
0.51
:
0.51
nos
0.50
Activations Density 0.516%