INDEX
Explanations
instances of conversational phrases or conditional statements in discussions about relationships
New Auto-Interp
Negative Logits
aur
-0.15
ebin
-0.15
whereas
-0.14
æīį
-0.14
rud
-0.14
abaj
-0.14
Whereas
-0.14
urat
-0.13
Prec
-0.13
probably
-0.13
POSITIVE LOGITS
varsa
0.18
_______,
0.17
ROKE
0.16
>",
0.15
rag
0.15
nosti
0.14
quieres
0.14
roke
0.14
yoksa
0.14
@",
0.14
Activations Density 0.105%