INDEX
Explanations
phrases related to expressions of opinion or disagreement
instances of the word "the."
New Auto-Interp
Negative Logits
fried
-0.66
#$
-0.65
ÙĴ
-0.65
esar
-0.65
itia
-0.64
hur
-0.63
aloud
-0.63
Cho
-0.63
.-
-0.62
ãĥ³
-0.62
POSITIVE LOGITS
sake
1.65
purposes
1.35
foreseeable
1.31
moment
1.09
longest
1.08
unin
1.07
past
1.05
meantime
1.05
duration
1.04
avoidance
1.00
Activations Density 0.051%