INDEX
Explanations
phrases indicating additional information included within parentheses
opening parentheses in the text
New Auto-Interp
Negative Logits
appropri
-0.76
obscurity
-0.76
inund
-0.75
veget
-0.75
antiv
-0.72
overfl
-0.71
undet
-0.71
microbiome
-0.71
diving
-0.70
nutrient
-0.70
POSITIVE LOGITS
â̦)
1.35
emphasis
1.28
laughs
1.14
...)
1.14
sic
1.08
hide
1.07
See
1.05
CBC
1.05
Laughs
1.03
Unless
0.99
Activations Density 0.062%