INDEX
Explanations
references to the Quran and Islamic teachings
New Auto-Interp
Negative Logits
aint
-0.16
reg
-0.15
cred
-0.15
Phoenix
-0.14
elden
-0.14
Phoenix
-0.14
Pow
-0.14
rd
-0.14
vs
-0.14
æ²Ļ
-0.14
POSITIVE LOGITS
andle
0.17
asher
0.16
ãģŁãĤĬ
0.16
ÄĽnÃŃ
0.15
inceton
0.15
á»ĩu
0.14
ÂŃi
0.14
âĻª↵↵
0.14
frei
0.14
/Instruction
0.14
Activations Density 0.020%