INDEX
Explanations
personal pronouns containing the letter 'I'
references to personal experiences or statements
New Auto-Interp
Negative Logits
Mehran
-0.78
pires
-0.66
é¾įå¥ij士
-0.62
Contents
-0.61
sacks
-0.61
advertisement
-0.59
rules
-0.58
steps
-0.58
tools
-0.58
comes
-0.58
POSITIVE LOGITS
dunno
1.64
suppose
1.46
guess
1.43
presume
1.33
'm
1.31
assume
1.20
've
1.17
haven
1.15
think
1.14
wonder
1.13
Activations Density 0.188%