INDEX
Explanations
conditional statements and contrasts
New Auto-Interp
Negative Logits
ogan
-0.18
chen
-0.16
ys
-0.14
subjective
-0.13
hek
-0.13
à¤ĩन
-0.13
imest
-0.13
ès
-0.13
ảng
-0.13
inks
-0.13
POSITIVE LOGITS
only
0.28
ONLY
0.22
only
0.21
Only
0.20
.only
0.20
seulement
0.20
ÑĤолÑĮко
0.19
Only
0.18
_only
0.17
åıª
0.17
Activations Density 0.186%