INDEX
Explanations
references to authority and governance
New Auto-Interp
Negative Logits
iku
-0.16
ugi
-0.15
Kra
-0.14
exercise
-0.14
ik
-0.14
Lam
-0.14
Dao
-0.14
amac
-0.14
es
-0.14
nik
-0.14
POSITIVE LOGITS
PE
0.25
pe
0.25
Pe
0.23
Pe
0.20
_pe
0.20
peÄį
0.19
-pe
0.19
.pe
0.19
pe
0.18
ipe
0.17
Activations Density 0.036%