INDEX
Explanations
concepts related to community, values, and equity
New Auto-Interp
Negative Logits
^^
-0.15
amplify
-0.14
otton
-0.14
.squeeze
-0.14
_clock
-0.14
achen
-0.13
.jetbrains
-0.13
roker
-0.13
lif
-0.13
olve
-0.13
POSITIVE LOGITS
inform
0.60
informs
0.54
Inform
0.53
inform
0.52
informed
0.52
informing
0.52
Inform
0.50
shape
0.43
shapes
0.41
shaped
0.40
Activations Density 0.362%