INDEX
Explanations
phrases related to dropping or reducing limitations and complexity
New Auto-Interp
Negative Logits
ÚĨار
-0.19
hữu
-0.17
oso
-0.15
basePath
-0.15
agrid
-0.14
kaum
-0.14
.bold
-0.14
linger
-0.14
OPS
-0.13
yonel
-0.13
POSITIVE LOGITS
barriers
0.18
ÑĦоÑĢми
0.17
boundaries
0.17
traditional
0.16
resistance
0.16
gap
0.15
Borders
0.15
inhib
0.15
existing
0.15
restrictions
0.15
Activations Density 0.254%