INDEX
Explanations
phrases related to physical actions or desires
themes related to sexuality and identity
New Auto-Interp
Negative Logits
xtap
-0.51
çīĪ
-0.47
ãĥ´
-0.47
ofi
-0.44
anwhile
-0.42
accum
-0.42
cumulative
-0.41
referen
-0.41
declarations
-0.40
Niet
-0.40
POSITIVE LOGITS
anymore
0.66
ASAP
0.65
someday
0.64
sooner
0.62
?ãĢį
0.59
?",
0.59
?".
0.58
Belfast
0.58
?
0.58
?'
0.56
Activations Density 1.440%