INDEX
Explanations
unique identifiers or special symbols within a text
New Auto-Interp
Negative Logits
subur
-0.14
Erotische
-0.13
seksi
-0.12
.Style
-0.11
erotik
-0.11
erotica
-0.11
alternatives
-0.11
elimin
-0.11
Delete
-0.11
explos
-0.11
POSITIVE LOGITS
honour
0.29
commem
0.28
commemor
0.27
honoured
0.27
honor
0.26
honoring
0.25
memorial
0.25
tribute
0.24
honored
0.23
Honour
0.23
Activations Density 0.016%