INDEX
Explanations
phrases indicating personal responsibility and ownership in experiences
New Auto-Interp
Negative Logits
TintMode
-0.58
彼は
-0.56
ništvo
-0.56
彼女は
-0.51
odkazy
-0.51
Huguen
-0.48
phosa
-0.47
Euphrates
-0.47
они
-0.47
ronique
-0.46
POSITIVE LOGITS
what
1.31
how
1.24
whatever
1.24
everything
1.16
whatever
1.04
everything
1.04
AssemblyCulture
0.98
where
0.98
why
0.97
wherever
0.95
Activations Density 0.459%