INDEX
Explanations
elements related to laws and regulations
New Auto-Interp
Negative Logits
dikke
-0.17
Admir
-0.15
uum
-0.15
/thumb
-0.15
stad
-0.14
Yunan
-0.14
awan
-0.14
dét
-0.14
Kak
-0.14
u
-0.14
POSITIVE LOGITS
§§
0.20
§
0.19
lex
0.17
irit
0.17
andro
0.16
erm
0.16
jur
0.15
Ink
0.15
ียม
0.15
norm
0.15
Activations Density 0.057%