INDEX
Explanations
concepts related to progress and liberal ideology in historical contexts
New Auto-Interp
Negative Logits
ubat
-0.17
εÏģο
-0.15
avax
-0.15
Turing
-0.15
empo
-0.14
itler
-0.14
ecd
-0.14
aku
-0.14
PointerType
-0.14
rez
-0.14
POSITIVE LOGITS
Lock
0.32
Hob
0.30
Locke
0.29
Lock
0.28
Bent
0.28
Raw
0.24
LOCK
0.23
.Lock
0.23
Raw
0.22
Burke
0.22
Activations Density 0.047%