INDEX
Explanations
instances where something is assumed or taken as a given
instances of the word "assumed" in various contexts
New Auto-Interp
Negative Logits
Money
-0.75
Sche
-0.71
licks
-0.68
deed
-0.68
alez
-0.66
Blog
-0.64
enta
-0.62
andro
-0.62
Tips
-0.62
deeds
-0.61
POSITIVE LOGITS
assumed
1.10
assume
1.04
assumes
0.86
assuming
0.78
"$:/
0.75
incorrectly
0.73
assum
0.73
llor
0.70
IELD
0.69
assumptions
0.65
Activations Density 0.008%