INDEX
Explanations
words related to beliefs, presumptions, hypotheses, or propositions
phrases related to beliefs or hypotheses in various contexts
New Auto-Interp
Negative Logits
Interstitial
-0.86
sung
-0.83
waters
-0.74
odes
-0.69
incinn
-0.67
HCR
-0.65
oho
-0.64
umen
-0.64
adan
-0.64
crew
-0.64
POSITIVE LOGITS
assumption
1.04
assumptions
1.01
underpin
0.80
assumes
0.78
Lauder
0.75
staking
0.74
incorrectly
0.70
disclaimer
0.70
assum
0.68
premise
0.68
Activations Density 0.011%