INDEX
Explanations
statements that emphasize or highlight specific facts
New Auto-Interp
Negative Logits
Neutral
-0.06
ix
-0.06
179
-0.06
stral
-0.06
:↵
-0.06
elda
-0.06
XmlNode
-0.06
arian
-0.06
sh
-0.05
mall
-0.05
POSITIVE LOGITS
fact
0.13
fakt
0.08
facts
0.08
FACT
0.08
Fact
0.08
fact
0.08
FACT
0.08
ÑĦак
0.07
Fact
0.07
_fact
0.07
Activations Density 0.005%