INDEX
Explanations
references to nobility and royal lineage
New Auto-Interp
Negative Logits
pleaſure
-0.69
ſche
-0.66
ſind
-0.62
itſelf
-0.59
hyrchwyd
-0.59
lyre
-0.59
houſe
-0.57
neceff
-0.56
ſelves
-0.54
ſelf
-0.54
POSITIVE LOGITS
ContentAsync
0.42
varones
0.40
nocturno
0.37
zeugt
0.37
directos
0.35
oscuros
0.35
Turnier
0.34
intermédiaire
0.34
romántico
0.33
rêver
0.33
Activations Density 0.350%