INDEX
Explanations
statements
This neuron responds to the core definition sentence that says a summary is factually consistent if “all statements in the summary are entailed by the document.”
New Auto-Interp
Negative Logits
وره
-0.08
okers
-0.07
ousedown
-0.06
Creators
-0.06
Danger
-0.06
角
-0.06
ùy
-0.06
اءات
-0.06
讯
-0.06
Matches
-0.06
POSITIVE LOGITS
.Txt
0.07
contempt
0.07
depicting
0.06
="-
0.06
vat
0.06
pledged
0.06
komen
0.06
altern
0.06
_LOWER
0.06
.api
0.06
Activations Density 0.001%