INDEX
Explanations
references to specific time periods or events
New Auto-Interp
Negative Logits
št
-0.17
Fried
-0.15
ãĤ³ãĥ¼ãĥī
-0.14
steps
-0.14
DEX
-0.14
ÏĦεÏħ
-0.14
Fowler
-0.13
Cum
-0.13
Steps
-0.13
Cum
-0.13
POSITIVE LOGITS
Misc
0.31
/misc
0.30
Misc
0.29
Uncategorized
0.28
misc
0.27
Miscellaneous
0.25
misc
0.24
miscellaneous
0.23
entertainment
0.22
Unc
0.21
Activations Density 0.170%