INDEX
Explanations
phrases related to detachment or disconnection
New Auto-Interp
Negative Logits
gio
-0.58
GB
-0.56
eta
-0.55
xual
-0.55
intimid
-0.54
iola
-0.54
graz
-0.54
Ô
-0.54
abund
-0.52
esan
-0.52
POSITIVE LOGITS
disconnect
0.72
yip
0.68
owship
0.66
omorph
0.64
ribut
0.64
alin
0.63
aline
0.61
detachment
0.60
Mub
0.59
Schr
0.59
Activations Density 5.452%