INDEX
Explanations
phrases related to controversial or debated topics
the presence of the word “the” and its variations in context, indicating a focus on specifics or details in discussions
New Auto-Interp
Negative Logits
gon
-0.84
duino
-0.77
tackle
-0.77
cil
-0.72
eat
-0.71
cair
-0.69
ce
-0.68
aternity
-0.68
elaide
-0.68
Serv
-0.68
POSITIVE LOGITS
facts
1.35
ories
1.25
evidence
1.23
testimonies
1.20
conclusions
1.18
truth
1.18
Facts
1.13
findings
1.12
assertions
1.12
slightest
1.12
Activations Density 0.338%