INDEX
Explanations
the name "Pal" with a varying number at the end
occurrences of the name "Pal."
New Auto-Interp
Negative Logits
gee
-0.80
EEE
-0.76
afety
-0.75
UFF
-0.72
diffusion
-0.68
shut
-0.68
ssh
-0.68
ews
-0.67
ufact
-0.67
tick
-0.66
POSITIVE LOGITS
Pal
3.81
Pal
2.63
pal
2.17
pal
1.94
PAL
1.45
Pad
1.37
Palm
1.34
Palmer
1.26
Paladin
1.26
palette
1.25
Activations Density 0.005%