INDEX
Explanations
specific historical or time-related terms
references to early events or historical contexts
New Auto-Interp
Negative Logits
md
-0.85
pherd
-0.79
agree
-0.74
Pool
-0.72
lua
-0.71
unal
-0.69
bnb
-0.68
arse
-0.67
Msg
-0.66
ractor
-0.66
POSITIVE LOGITS
iterations
1.14
stages
1.13
adop
1.10
incarnation
1.06
drafts
1.04
phases
1.04
beginnings
1.03
incarn
1.02
generations
1.01
versions
1.01
Activations Density 0.083%