INDEX
Explanations
references to the word "Del" followed by a numerical value
the word "Del" in various contexts
New Auto-Interp
Negative Logits
Moonlight
-0.70
SHIP
-0.69
AFL
-0.66
OLOGY
-0.64
beans
-0.61
FTA
-0.60
CPI
-0.58
OPS
-0.58
Jinn
-0.58
finder
-0.57
POSITIVE LOGITS
ayed
1.30
phi
1.26
usional
1.20
icate
1.19
aware
1.16
ivery
1.15
ivered
1.14
ivering
1.14
iber
1.09
inqu
1.09
Activations Density 0.020%