INDEX
Explanations
phrases related to struggles and experiences in life
New Auto-Interp
Negative Logits
.tables
-0.15
á»ģ
-0.15
ifs
-0.14
idak
-0.14
ewis
-0.13
ewood
-0.13
icom
-0.13
loh
-0.13
flater
-0.13
çļĦè¯Ŀ
-0.13
POSITIVE LOGITS
thinking
0.54
hoping
0.52
knowing
0.45
thinking
0.44
believing
0.39
expecting
0.37
hop
0.36
Thinking
0.36
Thinking
0.35
feeling
0.35
Activations Density 1.488%