INDEX
Explanations
positive affirmations and phrases that encourage self-reflection and growth
New Auto-Interp
Negative Logits
Writers
-0.16
ãĤ°ãĥ©
-0.16
æ¬
-0.15
iasi
-0.15
habi
-0.15
má»
-0.14
loit
-0.14
979
-0.14
Writing
-0.13
cxx
-0.13
POSITIVE LOGITS
rec
0.52
utter
0.35
speaking
0.31
speak
0.29
utter
0.28
delivering
0.28
Rec
0.28
rec
0.27
pron
0.26
delivery
0.26
Activations Density 0.482%