INDEX
Explanations
specific patterns of phrases that include the word "that"
instances of the word "that" indicating various clauses or phrases in the text
New Auto-Interp
Negative Logits
Returns
-0.72
laughs
-0.67
thumbnails
-0.62
NES
-0.58
REDACTED
-0.57
Installation
-0.57
aug
-0.56
Is
-0.55
Mech
-0.54
Beh
-0.54
POSITIVE LOGITS
are
1.37
aren
1.35
were
1.30
weren
1.29
involve
1.21
comprise
1.18
resemble
1.17
circulate
1.16
relate
1.16
defy
1.15
Activations Density 0.199%