INDEX
Explanations
negations and their implications within a broader context
New Auto-Interp
Negative Logits
successors
-0.69
Cosponsors
-0.68
artney
-0.68
OULD
-0.67
vernment
-0.66
ebted
-0.65
ipel
-0.65
enda
-0.62
subsequent
-0.62
ortment
-0.61
POSITIVE LOGITS
understatement
0.84
scarce
0.81
bitch
0.74
finite
0.71
noun
0.70
commodity
0.69
---------
0.69
causation
0.69
healer
0.68
lubric
0.68
Activations Density 0.235%