INDEX
Explanations
phrases related to unsupported or unproven claims
terms related to substantiation or lack thereof
New Auto-Interp
Negative Logits
NCT
-0.73
Ô
-0.69
Tycoon
-0.68
skirts
-0.66
ahime
-0.64
lain
-0.63
head
-0.60
Clubs
-0.60
Tate
-0.59
ARC
-0.58
POSITIVE LOGITS
iated
1.65
ive
1.34
iating
1.21
ively
1.19
iation
1.15
iable
1.15
ially
1.14
ivable
1.10
iate
1.10
iably
1.09
Activations Density 0.064%