INDEX
Explanations
references to planets or worlds in various contexts
New Auto-Interp
Negative Logits
outs
-0.18
erson
-0.16
erton
-0.16
edom
-0.15
poses
-0.15
erman
-0.15
sy
-0.14
aday
-0.14
ikel
-0.14
afi
-0.14
POSITIVE LOGITS
-wide
0.33
wide
0.27
esimal
0.26
arium
0.21
éĻħ
0.21
weit
0.20
wide
0.19
/star
0.18
/world
0.17
Wide
0.17
Activations Density 0.053%