INDEX
Explanations
instances of uncertainty or lack of information about identities
references to the word "who" indicating uncertainty about identity or authorship
New Auto-Interp
Negative Logits
framework
-0.80
MER
-0.70
retion
-0.69
emin
-0.68
strip
-0.67
warning
-0.64
Globe
-0.64
emp
-0.62
esm
-0.62
urg
-0.61
POSITIVE LOGITS
else
1.08
soever
1.04
owns
0.94
exactly
0.93
cares
0.91
cared
0.87
abouts
0.86
owes
0.85
dinand
0.83
participates
0.82
Activations Density 0.042%