INDEX
Explanations
proper nouns followed by what seems to be a unique identifier
the letter 'P' in varying contexts
New Auto-Interp
Negative Logits
hitch
-0.62
ties
-0.61
uality
-0.60
galaxies
-0.59
dressing
-0.58
straw
-0.56
editions
-0.54
laps
-0.54
unde
-0.54
seams
-0.53
POSITIVE LOGITS
P
3.30
Ps
1.95
P
1.72
PF
1.71
PB
1.70
p
1.70
PP
1.62
PK
1.60
PT
1.47
PN
1.45
Activations Density 0.025%