INDEX
Explanations
phrases that indicate complexity or contradictions in situations
New Auto-Interp
Negative Logits
олоÑģ
-0.16
rx
-0.15
illo
-0.15
OTH
-0.15
Sands
-0.14
ì¼Ģ
-0.14
unner
-0.14
acam
-0.14
o
-0.14
ãĤ±
-0.14
POSITIVE LOGITS
далеко
0.17
unfortunately
0.16
ä¹Łæľī
0.16
sometimes
0.16
auga
0.16
Bal
0.15
ÑıÑĤи
0.15
tempered
0.15
æĥ
0.15
ioni
0.15
Activations Density 0.165%