INDEX
Explanations
terms related to assumptions being made
references to assumptions
New Auto-Interp
Negative Logits
Interstitial
-0.81
deed
-0.69
hern
-0.68
HCR
-0.66
amen
-0.65
sung
-0.65
iferation
-0.64
paying
-0.64
ching
-0.64
aska
-0.63
POSITIVE LOGITS
assumptions
1.32
assumption
1.16
assum
0.91
assumes
0.81
princ
0.81
biases
0.79
underpin
0.78
assume
0.76
guesses
0.74
mistakes
0.74
Activations Density 0.011%