INDEX
Explanations
conditional phrases that express dependency
phrases indicating dependency or condition
New Auto-Interp
Negative Logits
vision
-0.80
nels
-0.79
mary
-0.68
atur
-0.67
lis
-0.67
cap
-0.65
vis
-0.65
nings
-0.64
jc
-0.64
Pall
-0.64
POSITIVE LOGITS
depended
1.15
depends
0.98
challeng
0.97
¬¼
0.91
depend
0.90
adversely
0.84
ratulations
0.84
confir
0.84
ende
0.83
awaru
0.80
Activations Density 0.011%