INDEX
Explanations
the conjunction "and" in various contexts
New Auto-Interp
Negative Logits
acas
-0.82
osure
-0.76
ciples
-0.75
phthal
-0.72
ETS
-0.70
occup
-0.69
pec
-0.67
imilar
-0.67
osures
-0.66
iji
-0.66
POSITIVE LOGITS
incidentally
1.14
romeda
1.10
possibly
1.10
optionally
1.05
rightfully
1.04
-)
1.02
vice
1.01
hence
1.01
presumably
1.00
!)
1.00
Activations Density 0.038%