INDEX
Explanations
phrases that suggest exclusions or limitations
New Auto-Interp
Negative Logits
nil
-0.18
isoft
-0.16
ç«¥
-0.15
\grid
-0.14
nds
-0.14
NIL
-0.14
.metro
-0.14
вÑģп
-0.14
fter
-0.14
refs
-0.14
POSITIVE LOGITS
limited
0.26
limited
0.23
limitation
0.20
not
0.20
Limited
0.20
éĻIJ
0.18
Limited
0.17
LIMITED
0.17
limit
0.17
restricted
0.16
Activations Density 0.006%