INDEX
Explanations
references to interpersonal relationships and interactions between characters
New Auto-Interp
Negative Logits
gleichen
-0.37
pośred
-0.36
naselje
-0.36
takiej
-0.34
今回は
-0.33
Portale
-0.33
takiego
-0.32
,
-0.31
此事
-0.31
similar
-0.30
POSITIVE LOGITS
Majefty
0.77
randomUUID
0.74
ſelf
0.72
houſe
0.72
pleaſure
0.69
typelib
0.61
weakSelf
0.60
deſſen
0.60
חיצוניים
0.59
盗撮
0.59
Activations Density 0.536%