INDEX
Explanations
phrases containing the word "that"
the repeated word "that" and its variations in context
New Auto-Interp
Negative Logits
heavily
-0.66
cap
-0.62
level
-0.61
priority
-0.60
bow
-0.59
rate
-0.58
preference
-0.58
status
-0.57
cost
-0.57
st
-0.57
POSITIVE LOGITS
that
2.97
those
1.80
which
1.69
they
1.59
these
1.56
there
1.49
such
1.49
whose
1.49
whether
1.48
this
1.46
Activations Density 0.018%