INDEX
Explanations
instances of the word "addition" or similar terms indicating an increase or inclusion
New Auto-Interp
Negative Logits
extra
-0.72
dafx
-0.55
ekstra
-0.55
extra
-0.53
supplemental
-0.47
esticides
-0.47
EXTRA
-0.44
Magdal
-0.43
Extra
-0.43
supplementary
-0.43
POSITIVE LOGITS
addition
2.36
Addition
2.27
addition
2.23
Addition
2.17
additions
1.88
Additions
1.77
Additions
1.50
Adding
1.27
Adding
1.25
inclusion
1.23
Activations Density 0.151%