INDEX
Explanations
positive descriptions or qualities
adjectives describing beauty or attractiveness
New Auto-Interp
Negative Logits
need
-0.64
Frankfurt
-0.64
matter
-0.62
tri
-0.57
Schr
-0.56
introduced
-0.55
pairs
-0.55
rein
-0.55
recon
-0.55
retail
-0.55
POSITIVE LOGITS
iful
4.57
ifully
3.31
iless
1.93
eous
1.50
iable
1.25
FUL
1.22
ificent
1.11
uitous
1.11
ful
1.10
ious
1.04
Activations Density 0.015%