INDEX
Explanations
assertions and discussions surrounding truth, facts, and beliefs related to specific issues and controversies
New Auto-Interp
Negative Logits
ưá»Ŀng
-0.15
tpl
-0.14
ανδ
-0.14
nio
-0.13
ίκ
-0.13
åł´
-0.13
å¿Ĺ
-0.13
_NV
-0.13
tel
-0.13
Overrides
-0.13
POSITIVE LOGITS
facts
0.87
fact
0.79
Facts
0.72
facts
0.71
Fact
0.68
FACT
0.68
fact
0.66
Fact
0.65
_fact
0.59
FACT
0.59
Activations Density 0.218%