INDEX
Explanations
references to an organization or group named Affirmation
New Auto-Interp
Negative Logits
²¾
-0.85
OPLE
-0.81
GY
-0.77
çķ
-0.72
senal
-0.72
DragonMagazine
-0.68
çīĪ
-0.65
Muller
-0.64
swick
-0.63
Mata
-0.63
POSITIVE LOGITS
idav
1.43
inity
1.27
irmation
1.20
irm
1.18
licted
1.18
irmed
1.18
leck
1.14
iliate
1.13
luent
1.12
luence
1.08
Activations Density 0.008%