INDEX
Explanations
references to celestial beings and their attributes
New Auto-Interp
Negative Logits
ċ
-0.21
ğ
-0.19
?↵↵↵
-0.18
ă
-0.17
â̦"↵↵
-0.16
]âĢı
-0.15
-č↵
-0.15
`{-0.15
Ğ
-0.15
čč↵
-0.15
POSITIVE LOGITS
,
0.98
,↵
0.77
.↵
0.75
.
0.73
,↵↵
0.68
.↵↵
0.66
:↵
0.63
,"
0.62
;
0.62
ØĮ
0.61
Activations Density 1.530%