INDEX
Explanations
instances of the word "determine" and its variations
New Auto-Interp
Negative Logits
ilities
-0.17
own
-0.17
nhau
-0.17
ses
-0.16
omer
-0.15
/bus
-0.15
ere
-0.15
iness
-0.15
teenth
-0.14
odia
-0.14
POSITIVE LOGITS
whether
0.20
ants
0.17
ally
0.16
fact
0.16
Whether
0.16
whether
0.16
extent
0.15
extent
0.15
Whether
0.15
lator
0.15
Activations Density 0.014%