INDEX
Explanations
terms related to sexual orientations, specifically focusing on identifying words related to bisexual individuals
references to bisexuality and related concepts
New Auto-Interp
Negative Logits
Eat
-0.83
EMS
-0.76
LV
-0.71
keeper
-0.68
Yard
-0.67
urers
-0.65
steen
-0.64
witz
-0.64
bilt
-0.63
hunt
-0.62
POSITIVE LOGITS
ity
1.18
iliary
0.88
ities
0.84
dar
0.81
iated
0.80
citiz
0.80
bisexual
0.80
iliated
0.79
isexual
0.75
inals
0.74
Activations Density 0.026%