INDEX
Explanations
instances of doubt or uncertainty in statements
expressions of reasons and justifications
New Auto-Interp
Negative Logits
occupations
-0.87
arest
-0.84
ancial
-0.77
ageing
-0.77
quartered
-0.76
negie
-0.75
laboratories
-0.75
dwellings
-0.75
entary
-0.73
emporary
-0.72
POSITIVE LOGITS
he
1.36
Rollins
1.00
Harbaugh
0.98
He
0.98
she
0.96
pissed
0.95
his
0.95
him
0.94
He
0.94
Saban
0.93
Activations Density 0.955%