INDEX
Explanations
comparisons and analogies in descriptions
New Auto-Interp
Negative Logits
ADIO
-0.17
DAQ
-0.15
IGHL
-0.15
Interop
-0.15
gezocht
-0.15
cir
-0.15
åīĩ
-0.14
søker
-0.14
086
-0.14
.Atomic
-0.14
POSITIVE LOGITS
ving
0.17
315
0.16
azzi
0.15
933
0.15
reform
0.15
Reform
0.15
normal
0.14
Happ
0.14
Lord
0.14
conquer
0.14
Activations Density 0.166%