INDEX
Explanations
phrases related to law, governance, and regulatory compliance
New Auto-Interp
Negative Logits
rez
-0.18
åIJ¦
-0.16
conce
-0.14
reject
-0.14
çīĩ
-0.13
ordo
-0.13
ůž
-0.13
pres
-0.13
imity
-0.13
éĻIJ
-0.13
POSITIVE LOGITS
voluntary
0.18
indu
0.16
instead
0.16
nud
0.16
anela
0.16
rather
0.15
subt
0.15
æŀļ
0.15
gentle
0.15
nici
0.15
Activations Density 0.213%