INDEX
Explanations
high-level trends or predictions about technological advancements
New Auto-Interp
Negative Logits
four
-0.17
sixth
-0.16
six
-0.16
fourth
-0.16
eight
-0.16
fifth
-0.16
nine
-0.15
two
-0.15
third
-0.15
unny
-0.15
POSITIVE LOGITS
III
0.29
II
0.26
101
0.24
IV
0.21
âħ
0.18
IIIK
0.18
vs
0.17
âħ¥
0.17
III
0.17
2
0.17
Activations Density 0.082%