INDEX
Explanations
patterns or sequences
repeated references to visual elements and attributes in descriptions
New Auto-Interp
Negative Logits
ŃĶ
-0.86
yss
-0.70
enum
-0.70
yrinth
-0.70
ressor
-0.68
bid
-0.67
Stronghold
-0.67
aternity
-0.66
ntil
-0.63
discipl
-0.62
POSITIVE LOGITS
mith
1.14
hips
1.13
ynthesis
1.10
heet
1.09
pots
1.08
peed
0.99
hops
0.97
pace
0.96
poons
0.96
paces
0.96
Activations Density 0.019%