INDEX
Explanations
references to the term "world" and its descriptors
New Auto-Interp
Negative Logits
illa
-0.18
ischen
-0.17
chen
-0.16
elor
-0.16
hoch
-0.15
ager
-0.15
.yy
-0.14
ardin
-0.14
orie
-0.14
variants
-0.14
POSITIVE LOGITS
wide
0.30
-wide
0.29
Wide
0.29
wide
0.28
Wide
0.28
wid
0.23
-ren
0.22
-class
0.19
premiere
0.19
traveler
0.18
Activations Density 0.039%