INDEX
Explanations
phrases involving the word "that" followed by a descriptor or action
phrases containing the word "that" and its relationships to various concepts or situations
New Auto-Interp
Negative Logits
citing
-0.64
foregoing
-0.64
ached
-0.63
noting
-0.63
concluding
-0.62
issuing
-0.61
fielding
-0.59
boarding
-0.59
ilot
-0.59
assis
-0.59
POSITIVE LOGITS
mattered
1.12
hurts
1.10
shouldn
1.07
nobody
1.07
ought
1.03
belongs
1.02
annoy
0.97
scares
0.97
horr
0.96
inspires
0.93
Activations Density 0.205%