INDEX
Explanations
references to historical religious events and figures
New Auto-Interp
Negative Logits
ROL
-0.17
edm
-0.15
ibur
-0.15
(çģ«
-0.15
æ´¥
-0.15
dap
-0.15
iane
-0.14
bab
-0.14
Bias
-0.14
ÑĢави
-0.14
POSITIVE LOGITS
brief
0.16
reign
0.16
Reign
0.15
according
0.15
Brief
0.15
succeeded
0.14
ledger
0.14
одеÑĢж
0.14
successor
0.14
BÄĽ
0.14
Activations Density 0.138%