INDEX
Explanations
references to influential figures and their teachings
New Auto-Interp
Negative Logits
彦
-0.15
leur
-0.15
printStats
-0.14
ovit
-0.14
айÑĤ
-0.14
ë§Īëĭ¤
-0.14
loy
-0.13
ordo
-0.13
obo
-0.13
ÙĦØ©
-0.13
POSITIVE LOGITS
himself
0.18
kins
0.15
ETIME
0.15
McB
0.15
Himself
0.15
steen
0.15
amon
0.14
رخ
0.14
Bars
0.14
bast
0.14
Activations Density 0.112%