INDEX
Explanations
phrases related to fabrication or falsification
terms related to falsification and misrepresentation
New Auto-Interp
Negative Logits
Empires
-0.68
Kingdoms
-0.67
Alz
-0.67
Apostles
-0.66
Surv
-0.66
Brilliant
-0.65
SPA
-0.64
Revenue
-0.64
gamer
-0.63
IRO
-0.62
POSITIVE LOGITS
ifiers
1.19
ified
1.18
ifying
1.14
eness
1.11
ifier
1.10
ifiable
1.09
esty
1.05
fals
1.05
itives
1.04
ifies
1.04
Activations Density 0.011%