INDEX
Explanations
names containing the word "Gay"
mentions of the word "Gay" in various contexts
New Auto-Interp
Negative Logits
arily
-0.86
aries
-0.76
umbing
-0.73
yrim
-0.73
igslist
-0.71
arians
-0.70
owl
-0.69
uncture
-0.67
icum
-0.65
ariat
-0.65
POSITIVE LOGITS
dos
1.07
nor
1.07
lord
1.03
lyn
0.93
bian
0.91
cation
0.86
Spectrum
0.85
bent
0.84
dar
0.83
zilla
0.83
Activations Density 0.024%