INDEX
Explanations
phrases starting with "We" and referring to collective experiences or actions
repeated use of the word "we" indicating a collective or group perspective
New Auto-Interp
Negative Logits
forms
-0.76
¿½
-0.62
persists
-0.61
states
-0.60
arises
-0.57
amide
-0.56
entails
-0.56
ocl
-0.56
cum
-0.55
constitutes
-0.54
POSITIVE LOGITS
're
1.36
ird
1.21
asel
1.10
've
1.08
athered
1.08
weren
1.07
bsite
1.03
eks
1.02
ighed
1.01
'll
1.01
Activations Density 0.208%