INDEX
Explanations
mentions of hypothetical scenarios/actions not taken
references to political figures and their actions or impacts
New Auto-Interp
Negative Logits
rawdownloadcloneembedreportprint
-0.74
ilial
-0.66
*/(
-0.65
externalToEVAOnly
-0.63
progresses
-0.58
atten
-0.57
FY
-0.56
extends
-0.56
underside
-0.56
sup
-0.55
POSITIVE LOGITS
would
1.27
wouldn
1.23
would
1.13
Would
1.12
'd
1.04
Wouldn
1.00
Would
0.99
probably
0.94
probably
0.90
surely
0.89
Activations Density 0.276%