INDEX
Explanations
references to specific brands or products, particularly in the context of consumer behavior or marketing strategies
New Auto-Interp
Negative Logits
she
-0.17
они
-0.16
they
-0.15
ils
-0.15
åħ¶
-0.15
ike
-0.15
wan
-0.15
rix
-0.14
imits
-0.13
ecute
-0.13
POSITIVE LOGITS
it
0.45
It
0.33
It
0.33
it
0.28
_it
0.28
,it
0.27
It
0.25
(it
0.23
.It
0.23
-it
0.22
Activations Density 0.338%