INDEX
Explanations
information related to personal experiences and activities
sentences detailing personal development and experiences
New Auto-Interp
Negative Logits
Plaint
-0.74
yourselves
-0.71
Their
-0.69
plaintiffs
-0.65
Enlarge
-0.64
Phelps
-0.64
arrell
-0.63
deputies
-0.62
Pelosi
-0.62
themselves
-0.61
POSITIVE LOGITS
blogging
1.12
myself
1.09
browsing
0.92
linux
0.91
coding
0.90
Patreon
0.90
my
0.89
experimenting
0.88
writing
0.88
compile
0.87
Activations Density 0.704%