INDEX
Explanations
information related to fundamental needs or concepts
New Auto-Interp
Negative Logits
igham
-0.84
isSpecialOrderable
-0.79
romeda
-0.74
Tycoon
-0.74
crow
-0.69
uthor
-0.69
Rodrigo
-0.68
oping
-0.67
IDA
-0.67
Allaah
-0.66
POSITIVE LOGITS
necessities
1.18
tenets
1.11
basic
0.98
arithmetic
0.97
lly
0.94
principles
0.90
outline
0.89
tenance
0.86
premise
0.85
gradient
0.84
Activations Density 12.541%