INDEX
Explanations
words related to visual stimuli and actions
dynamic visual elements and actions in descriptions
New Auto-Interp
Negative Logits
chery
-0.90
icians
-0.82
saf
-0.82
hers
-0.80
itism
-0.79
paying
-0.79
TRY
-0.78
thodox
-0.77
ches
-0.76
aligned
-0.76
POSITIVE LOGITS
Archdemon
0.85
sands
0.84
sickness
0.80
tide
0.79
masses
0.73
tides
0.72
sun
0.72
fortunes
0.71
hearts
0.70
NESS
0.70
Activations Density 0.148%