INDEX
Explanations
the repetition of the word "we," implying a focus on collective experiences or actions
New Auto-Interp
Negative Logits
WithIOException
-0.80
Thon
-0.75
Paar
-0.73
Padang
-0.69
Salat
-0.69
Monfieur
-0.68
Chy
-0.68
Ueber
-0.67
Rés
-0.67
Chy
-0.67
POSITIVE LOGITS
We
1.55
We
1.52
we
1.48
we
1.33
WE
1.14
I
1.08
I
1.06
Weinstein
1.05
weevil
1.04
they
1.02
Activations Density 0.189%