INDEX
Explanations
the word "purple" and related terms
references to the term "Puritan" or its variations
the beginning syllable "Pur"
New Auto-Interp
Negative Logits
enegger
-0.88
worthiness
-0.82
ORGE
-0.75
schild
-0.71
removable
-0.67
holding
-0.67
external
-0.67
FAULT
-0.66
backed
-0.63
ORED
-0.63
POSITIVE LOGITS
POSE
0.87
kie
0.83
zn
0.82
poses
0.81
pose
0.80
agin
0.80
rette
0.80
ryn
0.79
neys
0.79
itans
0.78
Activations Density 0.011%