INDEX
Explanations
phrases related to strong emotions and personal connections
expressions related to falling in love
New Auto-Interp
Negative Logits
ctive
-0.74
iliary
-0.72
shaw
-0.65
herty
-0.63
nea
-0.63
sylv
-0.63
200000
-0.62
wcs
-0.62
future
-0.62
CLA
-0.62
POSITIVE LOGITS
deaf
0.75
asleep
0.73
emetery
0.72
ĺħ
0.71
Haku
0.71
Pieces
0.69
Pigs
0.68
alach
0.66
osate
0.65
trap
0.64
Activations Density 0.106%