INDEX
Explanations
messages related to personal development and empowerment
motivational language focused on self-improvement and personal agency
New Auto-Interp
Negative Logits
—"
-1.02
ãĥīãĥ©
-0.86
Enlarge
-0.84
"—
-0.73
)—
-0.68
"]
-0.68
];
-0.67
Å«
-0.66
Åį
-0.66
âĢķ
-0.65
POSITIVE LOGITS
thats
1.25
alot
1.24
ie
1.21
but
1.18
haha
1.18
lol
1.15
whereas
1.15
tho
1.12
BUT
1.12
doesnt
1.10
Activations Density 1.496%