INDEX
Explanations
contrasting perspectives or narratives in a discussion
New Auto-Interp
Negative Logits
IGO
-0.18
igo
-0.16
addock
-0.16
bé
-0.15
ÑĨик
-0.15
reform
-0.14
è¦
-0.14
uchs
-0.14
.tc
-0.14
-cols
-0.14
POSITIVE LOGITS
tre
0.16
Kang
0.15
echan
0.15
nett
0.15
lob
0.14
åħĥ
0.14
}elseif
0.14
Jasper
0.14
struct
0.14
opat
0.14
Activations Density 0.393%