INDEX
Explanations
references to user experiences and interactions with products or services
People taking specific actions or having certain preferences
states of subjects
New Auto-Interp
Negative Logits
plupart
-0.85
maioria
-0.84
anyone
-0.83
każdej
-0.83
fleste
-0.83
każdy
-0.83
どれも
-0.82
flesta
-0.82
anyone
-0.81
anybody
-0.81
POSITIVE LOGITS
addirittura
0.92
outright
0.88
may
0.88
chiar
0.82
downright
0.82
甚至是
0.81
sogar
0.79
even
0.78
simply
0.77
might
0.75
Activations Density 0.553%