INDEX
Explanations
phrases related to names
terms related to queer identity and culture
New Auto-Interp
Negative Logits
ngth
-0.74
76561
-0.66
profession
-0.66
âĢ¢âĢ¢
-0.65
ripe
-0.62
slee
-0.57
resemb
-0.56
shapeshifter
-0.56
job
-0.56
iceberg
-0.56
POSITIVE LOGITS
itely
0.99
rences
0.88
bush
0.81
ilogy
0.78
ocally
0.77
ilver
0.74
mire
0.74
ĵĺ
0.73
lain
0.71
coat
0.71
Activations Density 0.076%