INDEX
Explanations
concepts related to value and its various forms in different contexts
New Auto-Interp
Negative Logits
ÄįÃŃ
-0.16
laz
-0.15
æ°ı
-0.15
oge
-0.15
lando
-0.15
sher
-0.14
Ã¥n
-0.14
icast
-0.14
ote
-0.14
nas
-0.14
POSITIVE LOGITS
proposition
0.41
propositions
0.36
-added
0.32
added
0.31
added
0.31
Proposition
0.30
Added
0.30
Added
0.29
-add
0.29
adding
0.25
Activations Density 0.025%