INDEX
Explanations
sentences with varying uses of "it" and "that"
New Auto-Interp
Negative Logits
atisfied
-0.17
neod
-0.15
aucoup
-0.14
ób
-0.14
storybook
-0.14
repr
-0.14
:');↵
-0.14
rnek
-0.14
orer
-0.14
Hats
-0.14
POSITIVE LOGITS
occurred
0.22
occurs
0.21
kind
0.19
occur
0.19
kind
0.19
appears
0.18
*
0.18
Occ
0.18
Freund
0.17
ocor
0.17
Activations Density 0.182%