INDEX
Explanations
elements of deception and infiltration in narratives
suggesting ignorance or unawareness
unaware of the truth
New Auto-Interp
Negative Logits
sério
-0.46
mogat
-0.46
AndEndTag
-0.45
XmlRootElement
-0.45
Organisateur
-0.44
Administrativna
-0.44
vejo
-0.43
creativos
-0.43
AndroidJUnit
-0.42
Codable
-0.42
POSITIVE LOGITS
oblivious
0.49
unaware
0.44
AccessFile
0.42
blind
0.42
mistakenly
0.42
unsuspecting
0.42
unsus
0.41
Dete
0.40
rub
0.40
innocently
0.39
Activations Density 0.465%