INDEX
Explanations
references to religious or spiritual themes and events
New Auto-Interp
Negative Logits
aney
-0.17
claimer
-0.16
Ñģол
-0.15
ioneer
-0.15
leo
-0.15
rench
-0.15
bib
-0.14
ittle
-0.14
çĨ
-0.14
fleet
-0.14
POSITIVE LOGITS
Hel
0.18
ughs
0.17
Omni
0.17
ingo
0.15
elli
0.15
Luxury
0.15
hel
0.15
ifax
0.15
Vin
0.15
Humph
0.15
Activations Density 0.003%