INDEX
Explanations
statements about existence, being, and the nature of self
New Auto-Interp
Negative Logits
GoPro
-0.84
multiple
-0.81
trending
-0.76
ICO
-0.75
CTR
-0.75
impact
-0.74
boost
-0.74
Crunch
-0.73
Ops
-0.72
marquee
-0.72
POSITIVE LOGITS
confess
1.06
pity
1.06
consec
1.05
pious
1.05
vain
1.01
theolog
1.01
philosophers
0.98
divine
0.97
blasp
0.97
scarcely
0.96
Activations Density 0.413%