INDEX
Explanations
instances of conflict and competition as described in various contexts
New Auto-Interp
Negative Logits
rather
-0.16
ury
-0.15
gere
-0.15
ær
-0.15
iteli
-0.14
æŃ£
-0.14
rather
-0.14
именно
-0.14
вмеÑģÑĤ
-0.14
acky
-0.14
POSITIVE LOGITS
whereas
0.23
Whereas
0.22
merely
0.18
typically
0.17
usually
0.15
tends
0.15
Typically
0.15
endant
0.15
typically
0.14
shi
0.14
Activations Density 0.166%