INDEX
Explanations
references to historical or spiritual practices
New Auto-Interp
Negative Logits
oku
-0.17
orelease
-0.15
âĸ²
-0.14
çī
-0.14
_sale
-0.14
plans
-0.13
scheduled
-0.13
opak
-0.13
isan
-0.13
spiracy
-0.13
POSITIVE LOGITS
spread
0.22
transmission
0.20
borrow
0.20
Transmission
0.20
copied
0.20
spread
0.20
influence
0.20
borrowing
0.19
borrow
0.19
transmitted
0.19
Activations Density 0.161%