INDEX
Explanations
words that feature rhymes or related to rhyme schemes
New Auto-Interp
Negative Logits
449
-0.19
normalize
-0.15
atas
-0.15
igans
-0.14
pers
-0.14
NavParams
-0.14
afone
-0.14
tyard
-0.14
Orient
-0.13
normalize
-0.13
POSITIVE LOGITS
Rh
0.30
Rh
0.26
ymes
0.26
rh
0.25
ynch
0.21
odes
0.21
annon
0.21
Rhodes
0.21
ubar
0.20
odium
0.19
Activations Density 0.010%