INDEX
Explanations
phrases that include the word "that" and its variations to identify clauses
New Auto-Interp
Negative Logits
acco
-0.17
acci
-0.16
.ro
-0.15
thal
-0.15
ocale
-0.15
jak
-0.15
еÑĢом
-0.15
hod
-0.15
lue
-0.15
tru
-0.15
POSITIVE LOGITS
anj
0.15
689
0.15
911
0.15
&page
0.14
олов
0.14
ombo
0.14
601
0.14
ify
0.14
subt
0.14
.INSTANCE
0.14
Activations Density 0.237%