INDEX
Explanations
the word "that" in sentences, possibly indicating a focus on specific contexts or conditions
the word "that" to identify clauses or statements
New Auto-Interp
Negative Logits
Guard
-0.67
oses
-0.66
lean
-0.65
gur
-0.64
respect
-0.63
aq
-0.63
le
-0.60
ounding
-0.59
´
-0.58
van
-0.58
POSITIVE LOGITS
they
0.92
soever
0.89
THEY
0.84
there
0.83
we
0.81
unlike
0.79
*/(
0.78
although
0.78
nobody
0.78
it
0.70
Activations Density 0.116%