INDEX
Explanations
phrases related to restrictions and conditions in social or legal contexts
New Auto-Interp
Negative Logits
actice
-0.15
rish
-0.15
kses
-0.14
reme
-0.14
vys
-0.14
Ïĩει
-0.14
##_
-0.13
anny
-0.13
.createClass
-0.13
rus
-0.13
POSITIVE LOGITS
uw
0.14
ibal
0.13
cÃŃ
0.13
Door
0.13
ÄŁan
0.13
nez
0.13
stor
0.13
door
0.13
ATUS
0.13
tah
0.13
Activations Density 0.190%