INDEX
Explanations
words related to pseudonyms or pseudoscience
terms related to pseudoscience and misleading claims
New Auto-Interp
Negative Logits
sights
-0.84
cakes
-0.84
cake
-0.84
sight
-0.61
ilion
-0.60
hearts
-0.60
servicing
-0.59
stakes
-0.59
brance
-0.58
hearted
-0.57
POSITIVE LOGITS
udo
1.08
udos
0.94
arial
0.89
atural
0.82
Pse
0.81
onymous
0.76
bull
0.75
onyms
0.75
crow
0.74
entials
0.72
Activations Density 0.028%