INDEX
Explanations
themes related to historical analysis and the evolution of ideas
New Auto-Interp
Negative Logits
esel
-0.18
è§
-0.16
asso
-0.14
éĨ
-0.13
Hierarchy
-0.13
correct
-0.13
rompt
-0.13
Descriptors
-0.13
пÑĢедÑĥÑģ
-0.13
ÙĦÛĮت
-0.12
POSITIVE LOGITS
ways
0.24
persistence
0.23
intersections
0.22
rise
0.22
question
0.21
lived
0.21
contested
0.20
disc
0.20
leg
0.20
relationship
0.19
Activations Density 0.155%