INDEX
Explanations
questions or statements with the word "how"
queries or discussions about uncertainty and the manner of various situations or events
New Auto-Interp
Negative Logits
ibl
-0.67
ubs
-0.66
gin
-0.66
usa
-0.64
agin
-0.64
sic
-0.63
yon
-0.63
athering
-0.62
article
-0.62
inas
-0.61
POSITIVE LOGITS
much
1.33
far
1.16
many
1.13
long
1.13
badly
1.06
much
1.04
often
1.04
closely
1.03
quickly
1.03
prevalent
1.01
Activations Density 0.071%