INDEX
Explanations
different versions or iterations of something
references to different versions or adaptations of stories or narratives
New Auto-Interp
Negative Logits
urers
-0.87
arers
-0.76
phasis
-0.75
ateurs
-0.74
ussion
-0.70
oji
-0.68
ering
-0.67
hya
-0.67
ennes
-0.67
erning
-0.66
POSITIVE LOGITS
Fortune
0.66
fortune
0.66
events
0.61
Xan
0.61
Christianity
0.60
Thing
0.59
CAT
0.59
Tomorrow
0.59
reality
0.59
Flip
0.59
Activations Density 0.117%