INDEX
Explanations
statements about existence or conditions of objects or concepts, often focusing on their characteristics or statuses
New Auto-Interp
Negative Logits
ä¸ĺ
-0.16
eus
-0.15
esine
-0.14
ahlen
-0.14
_SD
-0.14
è¾ħ
-0.13
quant
-0.13
aż
-0.13
eu
-0.13
VD
-0.13
POSITIVE LOGITS
fine
0.23
fine
0.19
documented
0.18
Fine
0.17
kind
0.17
covered
0.17
oko
0.16
discouraged
0.16
handled
0.16
legacy
0.16
Activations Density 0.251%