INDEX
Explanations
words related to deception and disguise
instances of the syllable "gu" in varied contexts
New Auto-Interp
Negative Logits
cling
-0.76
croft
-0.75
Spectrum
-0.72
rings
-0.69
hower
-0.69
riad
-0.68
cycle
-0.67
HAEL
-0.65
ŃĶ
-0.64
âĶģ
-0.61
POSITIVE LOGITS
pta
1.16
ilty
1.14
idelines
1.13
errilla
1.09
arding
1.09
cci
1.08
vernment
1.05
arant
1.03
inea
1.01
ests
1.01
Activations Density 0.021%