INDEX
Explanations
contrasting or alternative ideas presented with 'instead' or 'rather'
New Auto-Interp
Negative Logits
ä¸įæĺ¯
-0.25
ä¸įèĥ½
-0.24
tidak
-0.23
nicht
-0.23
ikke
-0.22
não
-0.22
not
-0.22
cannot
-0.22
không
-0.22
ä¸įä¼ļ
-0.21
POSITIVE LOGITS
merely
0.20
gaard
0.16
eti
0.15
opting
0.14
hen
0.14
ÙĨØ´
0.14
848
0.14
.cgi
0.14
pá
0.13
licht
0.13
Activations Density 0.059%