INDEX
Explanations
phrases related to increasing or reducing something
instances of the word "the."
New Auto-Interp
Negative Logits
çīĪ
-0.82
ouse
-0.79
itia
-0.71
ESE
-0.68
Pal
-0.68
gat
-0.67
itars
-0.66
perse
-0.66
replace
-0.65
gnu
-0.64
POSITIVE LOGITS
amount
1.50
number
1.36
likelihood
1.35
effectiveness
1.23
size
1.23
chances
1.18
incidence
1.16
scope
1.13
odds
1.10
extent
1.07
Activations Density 0.161%