INDEX
Explanations
words related to deception or dishonesty
terms related to deception and dishonesty
New Auto-Interp
Negative Logits
capacity
-0.83
foreseen
-0.78
Mobility
-0.74
area
-0.74
radius
-0.69
rait
-0.69
trak
-0.69
served
-0.69
Radius
-0.68
alg
-0.67
POSITIVE LOGITS
misrepresent
1.18
omission
1.15
deceive
1.12
deception
1.07
falsely
1.07
hoax
1.04
obfusc
1.04
deceived
1.03
deceit
1.02
liar
1.01
Activations Density 0.135%