INDEX
Explanations
quotations within the text with potential emotional or controversial impact
New Auto-Interp
Negative Logits
affiliate
-0.79
Ross
-0.77
rental
-0.75
outfielder
-0.73
matter
-0.73
rall
-0.72
grasp
-0.72
editor
-0.71
buoy
-0.71
McKay
-0.70
POSITIVE LOGITS
classic
1.45
false
1.44
true
1.44
pure
1.43
little
1.40
normal
1.40
every
1.38
almost
1.37
double
1.37
nothing
1.35
Activations Density 0.524%