INDEX
Explanations
conjunctions and coordinating phrases in complex sentences
New Auto-Interp
Negative Logits
etri
-0.17
crime
-0.15
ollen
-0.15
isoft
-0.15
oke
-0.15
hoa
-0.15
zel
-0.14
Crime
-0.14
yz
-0.14
icz
-0.14
POSITIVE LOGITS
Next
0.16
Copp
0.15
Tou
0.14
baum
0.14
ultimately
0.14
cip
0.14
adult
0.14
glm
0.13
without
0.13
--
0.13
Activations Density 0.079%