INDEX
Explanations
verbs or phrases related to making decisions or conclusions
instances of the word "determine" in various contexts
New Auto-Interp
Negative Logits
Lens
-0.68
GB
-0.63
far
-0.62
exper
-0.61
door
-0.60
hearted
-0.59
nic
-0.59
science
-0.58
Zeal
-0.58
ita
-0.57
POSITIVE LOGITS
determine
1.00
cules
0.91
determines
0.90
ially
0.85
ministic
0.81
ively
0.78
uate
0.75
evaluates
0.74
initions
0.73
Ͻ
0.73
Activations Density 0.011%