INDEX
Explanations
phrases related to conflict and rivalry
New Auto-Interp
Negative Logits
ekler
-0.16
idla
-0.16
entai
-0.15
orro
-0.15
оÑıÑĤелÑĮ
-0.15
oubted
-0.14
ÃĹ↵↵
-0.14
erli
-0.14
лиж
-0.13
è¿ŀæİ¥
-0.13
POSITIVE LOGITS
between
0.28
tensions
0.23
tension
0.23
between
0.23
heated
0.22
tug
0.21
Between
0.21
tranh
0.21
conflict
0.21
conflicts
0.20
Activations Density 0.338%