INDEX
Explanations
statements that include conjunctions and related phrases
Follows the word "and"
lists joined by and
New Auto-Interp
Negative Logits
They
-0.69
they
-0.68
I
-0.67
K
-0.62
A
-0.61
W
-0.61
it
-0.59
E
-0.59
1
-0.59
D
-0.58
POSITIVE LOGITS
other
1.02
pecially
0.92
]<<"
0.92
]='\
0.92
?>/
0.92
)";
0.91
ratulations
0.91
ignty
0.90
"):
0.90
.}(
0.90
Activations Density 0.720%