INDEX
Explanations
words related to swirling or spinning
instances of the term "girl" in various contexts
New Auto-Interp
Negative Logits
ħĭ
-0.69
QUIRE
-0.62
smart
-0.61
mature
-0.60
marrow
-0.60
punishing
-0.59
personalized
-0.59
imposing
-0.59
Forbidden
-0.59
demanding
-0.58
POSITIVE LOGITS
irl
1.04
itudinal
0.96
onge
0.89
iot
0.89
iated
0.88
itude
0.86
iffe
0.85
ipedia
0.85
ados
0.84
oin
0.84
Activations Density 0.008%