INDEX
Explanations
terms related to low values or rankings
references to low quantities or levels
New Auto-Interp
Negative Logits
mirrors
-0.67
dinosaur
-0.61
chairs
-0.61
attest
-0.60
pieces
-0.59
seeds
-0.58
words
-0.58
ver
-0.57
giant
-0.57
arms
-0.56
POSITIVE LOGITS
low
4.30
Low
2.10
high
1.79
LOW
1.68
lower
1.64
Low
1.59
highest
1.37
lows
1.33
worst
1.33
low
1.29
Activations Density 0.010%