INDEX
Explanations
sentences ending with a full stop
complete sentences
New Auto-Interp
Negative Logits
¥ŀ
-0.84
volunte
-0.77
nodd
-0.73
prosec
-0.72
encount
-0.71
suspic
-0.70
confir
-0.68
rall
-0.67
yip
-0.67
advoc
-0.66
POSITIVE LOGITS
My
1.71
My
1.60
I
1.56
I
1.43
my
1.31
myself
1.21
Whenever
1.17
Somehow
1.17
my
1.16
MY
1.15
Activations Density 0.545%