INDEX
Explanations
phrases related to relationships and accountability
New Auto-Interp
Negative Logits
cium
-0.67
Contin
-0.67
AX
-0.66
oids
-0.59
enium
-0.59
otropic
-0.59
ombat
-0.58
ibu
-0.57
utt
-0.57
iband
-0.56
POSITIVE LOGITS
theirs
1.86
hers
1.67
yours
1.51
ours
1.49
mine
1.31
his
1.25
their
1.22
your
1.18
his
1.17
your
1.12
Activations Density 3.047%