INDEX
Explanations
terms related to authenticity and credibility
terms and concepts related to authenticity and deception in media
New Auto-Interp
Negative Logits
Reef
-0.70
Weston
-0.66
Entered
-0.60
KS
-0.59
Kau
-0.58
Shepard
-0.57
Kra
-0.56
Fernandez
-0.55
Ninth
-0.55
Survivors
-0.55
POSITIVE LOGITS
usterity
1.14
theless
1.14
terday
1.03
\)
0.96
'?
0.96
lihood
0.94
»
0.94
estine
0.91
"
0.90
%"
0.90
Activations Density 0.338%