INDEX
Explanations
passages that discuss relationships, affections, and emotional connections between characters
New Auto-Interp
Negative Logits
Friedman
-0.18
ubi
-0.16
-ли
-0.15
chematic
-0.15
utz
-0.15
ikut
-0.14
ockey
-0.14
823
-0.14
ouver
-0.14
679
-0.14
POSITIVE LOGITS
alike
0.17
upon
0.16
ken
0.15
ar
0.14
vr
0.14
.Pending
0.14
.liferay
0.14
avid
0.14
pup
0.14
782
0.14
Activations Density 0.499%