INDEX
Explanations
references to biblical scripture and related legal terminology
New Auto-Interp
Negative Logits
ãĥ¼ãĥ
-0.17
stras
-0.16
же
-0.16
hem
-0.16
cons
-0.15
rof
-0.14
олод
-0.14
اÙĪØª
-0.14
uos
-0.14
iska
-0.14
POSITIVE LOGITS
oni
0.19
-chan
0.16
.drive
0.15
ạp
0.15
apter
0.15
iek
0.14
á»ĭch
0.14
kir
0.14
moc
0.14
NU
0.13
Activations Density 0.108%