INDEX
Explanations
phrases related to negative emotional states, particularly loneliness
concepts related to loneliness and solitude
New Auto-Interp
Negative Logits
alez
-0.90
ropri
-0.83
ulate
-0.82
rity
-0.79
emark
-0.77
aminer
-0.77
ulin
-0.76
assador
-0.76
tarians
-0.73
ruction
-0.71
POSITIVE LOGITS
confinement
0.89
oneliness
0.83
lonely
0.83
lust
0.76
Hots
0.74
loneliness
0.72
melancholy
0.70
melanch
0.69
bere
0.68
consolation
0.68
Activations Density 0.036%